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ABSTRACT 

We use a matched filter algorithm to find and study clusters in both N-body simulations artificially 
populated with galaxies and the 2MASS survey. In addition to numerous checks of the matched filter 
algorithm, we present results on the halo multiplicity function and the cluster number function. For a 
subset of our identified clusters we have information on X-ray temperatures and luminosities which we 
cross-correlate with optical richness and galaxy velocity dispersions. With all quantities normalized by 
the spherical radius corresponding to a mass overdensity of A a/ = 200 or the equivalent galaxy number 
overdensity of A^v = 200Q^ ~ 666, we find that the number of L > L* galaxies in a cluster of mass 
M 20 o is 

log N, 666 = (1.44 ± 0.17) + (1.10 ± 0.09) log(M 200 Vl0 15 M Q ) 

where the uncertainties are dominated by the scatter created by three choices for relating the observed 
quantities to the cluster mass. The region inside the virial radius has a K-band cluster mass-to-light ratio 
of (M/L)k = (116 ± 46)/i which is essentially independent of cluster mass. Integrating over all clusters 
more massive than M200 = 10 h M®, the virialized regions of clusters contain ~ 7% of the local 
stellar luminosity, quite comparable to the mass fraction in such objects in currently popular ACDM 
models. 

Subject headings: cosmology: theory - large-scale structure of Universe 



1. INTRODUCTION 

Clusters of galaxies have become one of our most impor- 
tant cosmological probes because they are relatively easy 
to discover yet have physical properties and abundances 
that are very sensitive to our model for the formation and 
evolution of structure in the universe. Of particular inter- 
est to us, a cluster sample provides the means to study the 
high-mass end of the halo multiplicity function, the aver- 
age number of galaxies in a halo of mass M, which provides 
important insight into the process of galaxy formation. 

In this paper we have two objectives. First, we will 
demonstrate the use of a matched filter approach to find- 
ing clusters in a redshift survey using both synthetic cata- 
logs and a large s ampl e of galaxies fr om th e 2MASS survey 
(Skrutskie et al. |l997j , Jarrett et al. |2000| ). With the syn- 
thetic data we can test the algorithm and our ability to ex- 
tract the input halo multiplicity function from the output 
cluster catalog. Second, we will determine the halo mul- 
tiplicity function, N(M), from the 2MASS survey. This 
study is the first phase in a bootstrapping process - based 
on a synthetic catalog known to have problems matching 
reality in detail we can calibrate an algorithm which when 
applied to the real data can supply the parameters for an 
improved model of the data. Then the improved model 
can be used to improve the algorithm and so on. 

In §^ we describe the synthetic and real 2MASS data. 
In §|| we describe our version of the matched filter algo- 
rithm. In sQwe test the algorithm on the synthetic catalog, 
focusing on our ability to determine the halo multiplicity 



function. In §|5| we apply the algorithm to the 2MASS sam- 
ple. Finally, in §^ we discuss the steps which can improve 
both the synthetic catalog and the algorithm. 

2. DATA: REAL AND SYNTHETIC 

We searched for clust ers us ing galaxies in the 2MASS 
survey (Skrutskie et al. 1997 ) and in simulations of the 
2MASS survey. We have chosen the 2MASS survey be- 
cause of its uniform photometry and large areal coverage, 
which makes it ideal for finding the relatively rare rich clus- 
ters in the local neighborhood where a wealth of auxiliary 
information is available. 

2.1. 2MASS Galaxies 

The 2MASS survey provides J, H, K photometry of 
galaxies over the full sky to a limiti ng magnitude of 
K < 13.75 mag (Jarrett et al. | 2p00 |). Based on the 



Schlegel, Finkbeiner & Davis ( 1998 ) Galactic extinction 
model and the available 2MASS catalogs, we selected all 
galaxies with Galactic extinction-corrected magnitudes of 
K < 12.25 magQ and Galactic latitude |6| > 5°. The final 
sample contains a total of 90989 galaxies distributed over 
91% of the sky. The redshift measurements are 89% com- 
plete for K < 11.25 mag but only 36% complete between 
11.25 < K < 12.25. The galaxies with redshifts are not a 
random sample of all of the galaxies - redshifts are more 
likely to be of cluster than field galaxies. 

1 We use the 20 mag/arcsec 2 circular isophotal magnitudes 
throughout the paper. 
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With the data corrected for Galactic extinction, the re- 
lation between apparent and absolute magnitudes is 



M. 



K 



K-5log(D L (z)/r )-k(z) = K-V(z) (1) 



where ro — 10 pc, Dl(z) is the luminosity distance to 
redshift z and k(z) = — 61og(l + z) is the fc-correction. 
The /c-correction is negative, independent of galaxy type , 
and valid for z <, 0.25 (based on the Worthey (1994) 
models). To simplify our later notation, we introduce 
V{z) — 51og(Z?L(z)/r ) + k(z) as the effective distance 
modulus to the galaxy. We use an fi ma t = 1 cosmological 
model for the distances and assume a Hubble constant of 
Ho = lOO/i km/s Mpc. For our local sample the particular 
cosmological model is un important. 

Kochanek et al. (2001) derived the K-band luminosity 
function for a subset of the 2MASS sample at comparable 
magnitudes. The luminosity function is described by a 
Schechter function, 
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where Mr — Mk* — 2.5 log(L/L*) and the integrated lu- 
minosity function is 
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cj){M)dM = n*r[l + a,L/L, 



(3) 



The parameters for the global luminosity function are n* = 
(1.16±0.10) x 10- 2 /i 3 /Mpc 3 , a = -1.09±0.06 and M x * = 
-23.39 ±0.05 using the 2MASS 20 mag/arcsec 2 isophotal 
magnitudes. This luminosi ty fu nction is consistent with 
that derived by Cole et al. (2001). The total luminosity of 
a galaxy is approximately 1.20 ± 0.04 times the isophotal 
luminosity. 

We should keep in mind that the early-type and late- 
type galaxies have different luminosity functions, with the 
early-type galaxies being brighter but less common than 
the late-type galaxies. The algorithm for finding clusters 
requires both a field luminosity function (f>f(M) and a clus- 
ter luminosity function </> c (M) which will be different be- 
cause of the morphology-density relation (e.g. Dressier 1980) 
In our present study we assume that we can model both 
using the global luminosity function, <f>f{M) — <f) c (M) = 
4>{M). Once we have catalogs of 2MASS galaxies in dif- 
ferent environments we can go back and estimate the 
environment-dependent luminosity functions. Note that 
we only need the shape of the luminosity function for the 
clusters - all expressions using <j) c (M) will appear in the 
ratio 4> c (M)/& c (Mk*), which eliminates any dependence 



2.2. Simulations of 2MASS 
We generated a simulated 2MASS catalog based o n the 



approach described in detail in White & Kochanek ( 2001 
hereafter WK). We used as a starting point the z = 
output of a 256 3 particle N-body simulation of a ACDM 
'concordance' cosmology whose predicted mass function 
agrees well with recen t obse rvational estimates (e.g. Pier- 
paoli, Scott & White 200l[ ). Halos were identified using 
the fricnds-of-fricnds (FoP) algorithm with a linking length 
6 = 0.2 times the mean inter-particle spacing, and galaxies 



were 'assigned' to halos based on the multiplicity function. 
The average number of galaxies assigned to a cluster halo 
with a FoF mass Mp a p is 
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(4) 



galaxies where M R = 2.5 x 10 12 /i _1 M Q and M B — 
4.0 x 10 12 /i _1 M Q bas ed on the halo occupancy models 
of Scoccimarro et al. ( 2001 ) which we have modified to 
better match the observed number densities of galaxies 
and the expectation that 2MASS galaxies will trace the 
mass more faithfully than a blue selected sample. We 
sometimes used the virial mass M200 defined as the mass 
interior to a radius r2oo within which the mean density 
exceeds 200 times the critical density. The virial radius, 
which scales as r 20 o = 1.6(M 2 oo/i/10 15 M Q ) 1 / 3 /i- :L Mpc, is 
slightly smaller than the average radius of the FoF cluster 
with b — 0.2, so that on average M200 = 0.6QMp o p (me- 
dian M200 = 0.71Mp o p). For massive halos the number of 
galaxies actually assigned to a halo is Poisson distributed 
for an expectation value of Np Q p. Each halo hosts a 'cen- 
tral' galaxy which inherits the halo center of mass posi- 
tion and velocity. Any additional galaxies are randomly 
assigned the positions and velocities of the dark matter 
particles identified with the cluster by the FoF algorithm. 
In this way the satellite galaxies trace the density and ve- 
locity field of the dark matter. 

One problem with standard halo multiplicity functions 
is that they are one-dimensional functions, N(M), nor- 
malized to match a sp ecific sample (but see Yang, Mo 
& van den Bosch 2002 for improvements in this regard). 
Even for galaxy mass scales, current models of the halo 
multiplicity function lead to too few galaxies to match 
the luminosity or velo city f unctions of galaxie s over broad 
ranges (see Kochane k |2001 ; Scoccimarro et al. 2001 ; Chiu, 



Gnedin & Ostriker 2001 ). For general theoretical use, 
we need the luminosity-dependent multiplicity function, 
N(> L\M), for the average number of galaxies more lu- 
minous than L in a halo of mass M. For clusters, we 
might expect a halo multiplicity function to factor as 
iV*(M)$ c (> L)/$ c (> L») where $ c (> L) is the inte- 
grated cluster luminosity function and N*(M) is the mass- 
dependent number of galaxies more luminous than L». In 
the language of Yang et al. ( 2002| ) this assumption cor- 
responds to assuming a weak dependence of their $», 
and a on M in this range. 

With our modified multiplicity function, Eq. (Q), we 
can match the numbe r density of galaxies in the global 
Kochanek ct al. (2001) K-band luminosity function or the 
K-band galaxy number counts if we view these galaxies 
as a complete sample truncated at L min — 0.01L*. The 
average number of galaxies assigned to a cluster of mass 
M at redshift z in a catalog with magnitude limit K\i m is 
(N(M,z)) = Np p(M)C(z). The completeness function 
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(5) 



where L(z) = max(L m j n , Ln m (z)) for the limiting luminos- 



ity of 2.51og(Li im (z)/L* 
parameters, Lu m (z) = L„ 



M K , - (K nm - V[z)). For our 
n at redshift cz = 1350 km/s 
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and L\i m (z) — at cz — 14000 km/s. The average num- 



ber of galaxies with L > L* is 



N, 



*FoF 



(M) 



N FoF (M)T[l + a, 1] 
T[l + a, L min /L*] 



0.042AT FoF (M) 



(6) 

for our standard parameters. On average, clusters with 
Mh/M Q = 10 15 , 10 14 and 10 13 are assigned 500, 53 
and 6 galaxies of which 21, 2 and 0.2 are brighter than 
£*. Over the range we actually find clusters, this rela- 
tion is well fit by a power-law of the form log iV*666 = 
A + Blog(M 2 ooh/10 15 M Q ) with A = 1.32 and B = 0.98. 
We adopt this form to describe the halo occupancy func- 
tion and summarize all of our subsequent estimates for it 
in Table § 

The model catalog contained 100706 galaxies, whose 
clustering properties are close to those of the observed pop- 
ulation. Unlike WK, where we characterized the observed 
properties of the galaxy simply by the accuracy with which 
its redshift could be determined, we assigned K-band mag- 
nitudes to each galaxy by drawing randomly from the lumi- 
nosity function. This process oversimplifies several aspects 
of the real survey. First, we did not include the environ- 
ment dependence of the luminosity function. Second, for 
lower mass halos where there is essentially one galaxy per 
halo, we did not correlate the luminosity with the mass 
of the halo. Third, to mimic the redshift measurements 
in the 2MASS sample we assumed that all redshifts were 
measured for K< 11.25 mag and that one-third, randomly 
selected, were measured between 11.25 < K < 12.25. This 
mimics the statistics of the real data, but the redshift mea- 
surements in the real data are more likely to be of clus- 
ter galaxies than of field galaxies. Finally we ignored the 
|6| > 5° latitude cut made for the 2MASS data and simply 
analyzed the synthetic catalogs for the whole sky. Most of 
these simplifications should make it more difficult to iden- 
tify clusters in the synthetic catalog than in the real data. 
Indeed, the visual impression when comparing slices of the 
model survey and 2MASS data is that the "fingers of god" 
are weaker in the model. We intend to significantly im- 
prove our mock galaxy catalogs in the next iteration, using 
the relations we derive from the data in later sections (see 
§||) combined with a better underlying simulation. 

3. FINDING AND ANALYZING THE CLUSTERS 

Large cluster samples have been constructed using five 
general approaches. In the first method, used to con- 
struct the orig inal large catal ogs and thei r suc cessors 
(e.g. Abell |195S|; Shectman |1985|; Dalton et al. |1992|; Lums- 
den et al.T!§9£ 



Ostrander et al. 1998; Scoddeggio et 
al. |1999| ; Gladders & Yee [2000| ; White et al. |1999| ), clusters 
were selected as overdensities in the projected density of 
galaxies on the sky. It was quickly realized such surveys 
suffer from projection effects and there is a large scatter 
between optical richness and cluster mass (for recent theo- 
retical studies see e.g. va n Haa rlem, Frenk & White 1997; 
Reblinsky & Bartelmann |1999| WK). 

Recently, more quantitative versions of these methods 
based on matched filter algorithms to fi nd eit her the dis- 
cre te clu ster galaxie s (Po stman et al. 1996] ; Kepner et 



al. |1999| ; Kim et al. |200l| ) or the excess luminosity from 



unresolved gala xies (D alcanton 



1996 



Zaritsky et al. 



1997 



ters at intermediate redshifts. A host of approaches have 
recently b een a pplied to the data from the SDSS (see Bah- 
call et al. |2002 f or rec ent discussion and Goto et al. 2001 
and Nichol et al. 2000] for early work) 



Second, with the advent of large redshift surveys, clus- 
ter catalogs were constructed by finding three-dimensional 
overdensities in the galaxy distribution using the fri ends 



of-friends (FoF) a lgorit hm (e.g. Huchra fc G eller 1982 
Geller & Huc hra |1983| ; Rame lla et al. |1994|; Ramella 



Pisani & Geller |l997| Christlein |2000| , Ramella et al. |2002|). 
Studies of the FoF al gorith m using N -body simulations 
(Nolthenius & White |1987| ; Frederic |1995| ; Diaferio et 



al. 1999) demonstrate the FoF algorithm works well for 



appropriate choices of the linking param eters. The tree or 
"dendogram" methods (e.g. Tully 1987) are related to FoF 
methods. 

The third approach to bu ilding cluster cata logs i s to use 



X-ray sur veys ( Gioia et al. 199C; Edge et al. 1990; Henry 



& Arnaud 1991: Rosati et al. 1995; Jones et al. 1998; Ebel- 



ing et a]Jl99§; Vikhlinin et al. J199&1 ; Romer et al. pOOO 
Henry gOOfj; Blanchard et al. p000| ^ Scharf et al . P000 



Bohri nger e t al. [200l| ; Ebeling, Edge & Henry |200l| ; Gioia 



et al. 2001). X-ray surveys avoidg many of the problems 
in selection and characterization which have made optical 
catalogs difficult to use, but frequently lack the sensitivity 
to detect and characterize groups or clusters at z ^> 0.5. 
Finally, two rapidly developing approaches are to use sur- 
vey s bas ed on the Sunyaev-Zel'dovich effect (Carlstrom et 
al. |2000|_ or on weak gravi tational lensin g (Sc hneider & 
Kncib H998| ; Wu et al. |199& H attori et al. |1999| ; Wittman 



beke Olackey 2002) 



et al. 2001; Moller et al. 2001; but see White, van Waer- 



Gonzalez et al. 2001) have been developed to find clus- 



3.1. The matched filter algorithm 

We identify groups and clusters in the 2MASS catalog 
using the modified optimal filter method we developed in 
WK, which is i tself d erived from the similar algo rithms of 
Kepner et al. ( |1999| ) and Postman et al. fll996| ). These 
methods identify clusters as 2- or 2.5-dimcnsional over- 
densities in three-dimensional redshift surveys. This is a 
departure from the essentially uniform use of the FoF algo- 
rithm to find clusters in redshift surveys. While the FoF al- 
gorithm clearly works to identify groups and clusters with 
very small numbers of galaxies, a matched filter approach 
provides several advantages. Matched filters provide like- 
lihood estimates for the detection, membership probabil- 
ities for individual galaxies, a range of cluster properties 
and their uncertainties and (importantly for us) can natu- 
rally incorporate galaxies both with and without redshift 
measurements . 

In WK we used N-body simulations to explore the selec- 
tion of clusters in a series of model surveys for clusters at 
intermediate redshift. In particular, we developed strate- 
gies for selecting samples above a fixed mass threshold over 
a range of redshifts, showing that it was relatively easy to 
obtain highly complete samples at the price of significant 
contamination from real but somewhat less massive clus- 
ters. Contamination by detections of non-existent clusters 
was rare. 

The algorithm works as follows. We model the universe 
as a uniform background and a distribution of k = 1 • • • n c 

2 But see Lewis et al. (2001). 
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clusters. Cluster k is described by its angular position 
9 k , (proper) scale length, r ck , a fixed concentration, c, 
redshift Zk and galaxy number N k . To ease comparisons to 
theoretical models we describe the gala xy de nsity profile 
by an NFW (Navarro, Frenk & White |1996| ) profile. In 
three dimensions, the galaxy distribution normalized by 
the number of galaxies inside an outer radius r out = cr c is 



p(r) 



Airr^F{c) x(l + xf 



(7) 



where x = r/r c , F(x) = ln(l + x) — x/(l + x) and 
47r J Q r 2 drp = 1. The relative number of galaxies in- 
side radius r is N(< r) — F(r/r c )/F(c). The projected 
surface density is more complicated, 



f(R/r c ) 
2nr 2 F(c) 



where 
fix) 



x 2 -l 



1|V: 



-tann 



x - 1 


1/2" 


x + 1 





(8) 



(9) 



and tann 1 (x) = tan 1 (x) (tanh 1 (x)) for a; > 1 (x < 1). 
The relative number of galaxies projected inside cylindrical 
radius R is N(< R) = g{R/r c )/F(c) where 



g(x) = ln(x/2) + 



1|V2 



tann 



x- 1 



1 



1/2 



(10) 



(Bartclmann |1996 ). The projected quantities are consid- 
erably more complicated than the simple model (X oc 
(1 + x)~ 2 ) we used in WK, but the earlier model corre- 
sponds to a three-dimensional density which is too compli- 
cated for convenient use. We fixed the halo concentration 
to c — 4 and the break radius to r c = 200ft -1 kpc, as typ- 
ical parameters for cluster-mass halos. We estimate the 
probability a survey galaxy has its observed properties as 
either a field or a cluster member galaxy. These properties 
can include the flux, redshift, angular position and color, 
with a basic distinction being made between galaxies with 
redshifts and those without. 

3.2. Probabilities for Field Galaxies 

The probability of finding a field galaxy of magnitude 
Ki and redshift Zi is 

P f (i) = 0.41nl0 DUzJ^MKi ~ Z>(*0) (H) 

where Dq is the comoving distance. The probability of 
finding a field galaxy of magnitude Ki is 



(12) 



where dN/dm = dzPf(K, z) are the differential num- 
ber counts. 



3.3. Probabilities for Cluster Galaxies 

The probability of finding a cluster galaxy of magnitude 
Ki and redshift Zi at an angular separation Oik from a clus- 
ter at redshift z c k with (proper) scale length r c k, galaxy 
number N* ck , and rest frame velocity dispersion a ck is 

P /• I \ AT - V ( Z ck)) ^ ( DA{Z ck )0 tk 

®c(M Klf ) \ r ck 



exp(-c 2 (zi - z ck ) 2 /2a 2 k (l + z ck ) 2 ) 



2ira ck {l + z ck ) 



(13) 



The number of galaxies N* ck is normalized to represent 
the number of galaxies brighter than L» inside the ra- 
dius r out = cr c . This normalization is convenient for 
calculation, and we discuss other normalizations in §3.6 



The performance of the algorithm in correctly determin- 
ing galaxy membership and cluster velocity dispersions is 
significantly enhanced if we optimize the width of the ve- 
locity filter for each cluster candidate. We used the range 
150km s -1 < a ck < 1200km s~ . For a galaxy with an 
unknown redshift, the likelihood is 



Pf - U \-AT ~ ^CggfcH V ( D A(Zck)0tk 

Pc{h k) - N * ck MM K ,) ^ 1 r ck 



(14) 



We calculate the likelihood in \h~ l Mpc regions around 
each galaxy based on either the measured redshift or the 
average redshift for galaxies with the observed magnitude. 
The search area was limited to A9 = 4°. The fraction of 
cluster galaxies inside search radius R = DA(z ck )A9 of a 
cluster at redshift z ck is 



A k (A9) 



g{R/r c ) ScOKita, - V{z k )) 



F(c) $ c (M ft ) 
3.4. Likelihood Function 



(15) 



The likelihood function for finding the first cluster n c 
k = 1 is 



Aln£(fc) = -N* ck A h 



£ 

3=1 



hi 



Pf{j)+N, ck P ck {j,k) 

PfU) 



(16) 

where the sum extends over all galaxies within angle A9 of 
the cluster. We compute the likelihood at the positions of 
each survey galaxy over a range of trial redshifts, optimiz- 
ing the number of galaxies N* ck for each trial. We then add 
the trial cluster which produced the largest change in the 
likelihood to the cluster sample. We automate the process 
and avoid finding the same cluster by including each new 
cluster in the density model used to find the next cluster, 



Aln£(n c ) 



The clusters k = 1 • • • n c — 1 arc our current catalog and 
have fixed properties, while k — n c is our new trial cluster. 
By this method we "clean" the clusters out of the sample 
in order of their likelihood (for further discussion see WK). 

We found that several small modifications improved the 
performance of the algorithm. In most cases these are pri- 
ors on the filter variables iV„ c and a ck designed to stabilize 
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parameter estimation for systems with very fe w g alaxies. 
First, we added a prior to the likelihood, Eq. (|l7|), of the 
form — hi(N§ + N% c ) with Nq = 0.1 comparable to the 
value of N* c expected for a very poor group. This en- 
codes the information that massive clusters are rare, with 
a slope roughly matching the power-law slope of the halo 
mass function. In some senses the prior acts to control 
the problem of Malmquist biases created by combining a 
steep number function dn/dN* c with the uncertainties in 
estimates of N* c . Second, when optimizing the likelihood 
over the width of the velocity filter o~ Cl we included a prior 
on the likelihood of the form \n(a c /ao) with the arbitrary 
normalization of <7q — 10 3 km/s. This prior makes the like- 
lihood independent of o~ c when a candidate cluster contains 
only one galaxy with a redshift measurement^. Third, we 
added a weak prior on the relation between N* c and o~ c 
based on our initial results. Preliminary, consistent mod- 
els found that log A* c = 1.13 + 1.90 log(<r c /1000km/s) with 
a dispersion of 0.39 dex. After determining AT» C as a func- 
tion of o~ c on a coarse grid, the likelihoods were weighted 
by this log-normal prior. The prior is only important for 
setting the filter width a c in systems with too few galax- 
ies for a direct estimate of the velocity dispersion. It also 
helps to suppress the superposition of clusters into a single 
larger cluster because superpositions have large apparent 
velocity dispersions for the number of galaxies. Fourth, for 
the trial galaxy used to compute the likelihood we used the 
mean surface density inside the break radius r c rather than 
the central density of the profile in computing the likeli- 
hood. This reduces a bias towards finding clusters centered 
on isolated galaxies given our cusped density profile. 

3.5. Matching to Existing Catalogs 

Once we have identified a cluster candidate we want to 
compare its properties either to the known properties of 
our synthetic catalog or to other properties of the real 
clusters. We cross reference our cluster catalog to either 
published or the synthetic cluster catalogs both through 
the member galaxies and by matching the cluster coor- 
dinates and redshifts, separately tracking both identifica- 
tions. Each galaxy is associated with its most probable 
cluster, the cluster which maximizes its membership like- 
lihood, and has a likelihood ratio Si between the probabil- 
ity that it is a member of that most probable cluster and 
the probability that it is a field galaxy. This likelihood 
ratio defines the probability that the galaxy is a cluster 
member, pi = <5j/(l + Si). 

In matching our new catalog to existing catalogs we re- 
quire algorithms which are fully automated and robust. 
They will not be perfect, but we cannot afford to conduct 
laborious individual case studies in a broad survey. For 
our synthetic data, each galaxy is labeled by its parent 
cluster, allowing us to match the output and input cata- 
logs by identifying the most common parent cluster for all 
the galaxies with membership probability pi > 0.5 (Si > 1) 
in each output catalog. This means of identification is ro- 
bust, with relatively few multiple identifications or false 

3 For a maximum likelihood estimate of the velocity dispersion 
from a set of velocity measurements Vi , adding this prior leads to an 
unbiased estimator of the velocity dispersion. If — In C = N In a + 
J2i( v i -»>) 2 /( 2ct2 ) adding a In cr prior and solving for a leads to the 
unbiased estimator a 2 = (N — — v) 2 . Without the prior 

the estimator is biased by N/(N — 1). 



identifications. 

For the real data, the inhomogeneity of local cluster 
identification enormously complicates cross references be- 
tween our cluster identifications and previous studies. The 
primary problem is that cross referencing is best done by 
cross referencing the proposed member galaxies, but few 
local group/cluster surveys supply such data. We settled 
on an ecumenical approach using the NASA Extragalac- 
tic Database (NED) to identify the galaxies likely to be 
associated with each known cluster and then matching 
our cluster catalog to the existing catalogs based on the 
member galaxies. We searched NED for all "clusters" (ob- 
ject types GClstr, GGroup, GTrpl & GPair). From this 
list of 31130 clusters we looked for overlapping identifica- 
tions which were not recognized by NED. This required 
some iteration, and was a hopeless task at low redshift 
(near Virgo) where a nearly infinite number of "groups" 
have been found, many of which appear to be substruc- 
tures of a larger object. Based on these identifications we 
assigned cluster identifications to galaxies within a pro- 
jected radius 0.5 ft. -1 Mpc and a redshift difference less 
than 1000 km/s. We then attempted to match our cluster 
detections to these galaxies using both the match to the 
cataloged cluster position and redshift and the most com- 
mon cluster names associated with the galaxies we assign 
to our detection. The final cross-matching system works 
reasonably well but is by no means perfect. 

3.6. Mass Estimates 

In addition to identifying the parent cluster and the 
member galaxies we would also like to estimate the mass 
of the cluster. We have four different ways of making the 
comparison. First, for the synthetic catalogs we have di- 
rect estimates of the masses. Second, for the real data 
we have measurements of X-ray luminosities and temper- 
atures which may serve as surrogate estimates of the mass. 
Third, we have the characteristic number of galaxies in the 
clusters. Fourth, we have estimates of the galaxy velocity 
dispersion in the clusters. 

Because clusters lack sharp edges, we must exercise 
some c are in defin ing and comparing our mass estimates 
(White 2001 , 2002 ). In fact, we found that failure to prop- 
erly match mass definitions affects both the normalization 
and the slope of our estimates of the halo multiplicity func- 
tion. For this reason we give explicit details of our proce- 
dure below. 

We use two standard mass estimates for the clusters in 
our synthetic catalog. The friends-of- friends mass, Mp F, 
is the total mass of the N-body particles linked into a 
group using the FoF algorithm with a linking length of 
b = 0.2 in units of the mean inter-particle spacing. We as- 
sign galaxies to the clusters based on the FoF mass (Eq. |^) 
because it can be computed quickly while we are build- 
ing the synthetic galaxy catalogs. We also calculate sep- 
arately the mass M200 inside the radius rjv/200 for which 
the enclosed density is Am = 200 times the critical den- 
sity. When we compare these two estimates we find that 
M200 — 0.66Mp O F because the FoF mass corresponds to a 
lower density threshold of Am — ^m/& 3 ~ 40. 

For the real data, where we do not measure the masses 
directly, we will compare our estimates from the galaxies 
to X-ray luminosities and temperatures. We used compi- 
lations of X-ray luminosities from Bohringer et al. (2000), 
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d200lH 



Cruddace et al 
De Grandi et al. 
Reiprich & Bohringer 
ray temp eratures 



Ebeling et al. (|1996| , |1998L g000|) , 
P)00|), M ahdavi et al. ( p000| ), and 
( 2002 ), and c ompi lations of X- 
from David et al. (1993), Ponman et 



al. ( |1996| ), Markevitch ( [1998D , Hclsdo n & Ponman ( |2000[ ) 
Ikebe et al. $002\ ) and White fl2000p . Most of the X-ray 
luminosity data is drawn from the ROSAT All-Sky Sur- 

19991 ) 



vey (RASS, Voges et al. 



We rescaled the X-ray 



luminosities and fluxes to a common scale by allowi ng for 
logarithmic offsets relative to Reiprich & Bohringer ( 2002 ) 
for each survey, with the offsets determined from clusters 
with measurements in multiple surveys. The offsets we 
applied are listed in Table 0. Measure d sca ling relations 
for clusters of galaxies are (Markevitch |f 998) 



log 



(l0 44 ergs/t 



(().ir>±iu)4H(2.1i)±ii.24il<>*( 

( 6 18) 



and (Horner, Mushotsky & Scharf f 999| ; Nevalainen, Markc 



vitch & Forman 2000 ; Finoguenov, Reiprich & Bohringer 2001 
Xu, Jin & Wu fjOOlf 



M A ~ 10 15 /i _1 M G 



1.3keV/ 



3/2 



A M E 2 



-1/2 



(19) 



where Am is the density threshold used to estimate the 
mass Ma, and E(z) = H(z)/H = 1 for our low redshift 
sample. The normalization of this last relation is currently 
quite uncertain, at the 30-50% level, due to the notori- 
ous difficulties inherent in measur ing th e mass of a cluster 
(e.g. Evrard, Metzler & Navarro 1996|) and differences in 
methods for fitting temperatures to t he X- ray emitting 
ICM. (See Fig. 2 of Huterer & White |2002| for a compi- 
lation of theoretical and observational results.) Different 
authors use different methods to model the 'temperature' 
of the plasma, different methods of estimating the mass 
and indeed even different definitions of the mass! To cloud 
the picture even further, numerical simulations based on 
purely adiabatic gas physics find a different normalization 
of this relation than most of the observations (although 
it must be noted that significant progress i n thi s regard 
ha s occu rred recently, e.g. Muanwong et al. 2002| , Voit et 



al. 2002) 



The number of member galaxies or their aggregate lu- 
minosity must be carefully defined before we can compare 
it to other results or theoretical models. Our fit parameter 
N* c is defined to be the number of L > galaxies inside 
the fixed, spher ical outer ra dius r out = cr c = 0.8 /t" 1 M pc. 



Postman et al. ( 1996 , 2002 ) and Donahue et al. ( 2001 ) use 
the luminosity, measured in units of L*, rather than the 
number. We prefer the number of galaxies, JV*, to this 
equivalent luminosity, A*, because it has a closer relation 
to the Poisson statistics which determine the detectability 
of clusters and may depend less on the defining luminosity 
function or wavelength. For the same normalizing aper- 
ture, the two quantities are related by 



A*L* = A*r[2 + a]/T[l + a, 1] = 5.0iV,L* 



(20) 



for a = — 1 .09. We cannot precisely match the Postman et 
al. (1996) definition of A c ; because we use different filters 
and luminosity functions. To the extent that A c / is defined 
by a r c i = Mpc (1.5h? 5 Mpc) radius aperture, we 

expect A* c/ ~ (g{r c i/r c )/ F(c))N* c = 1.6iV, c . 



While N* c and A c / are natural variables for finding clus- 
ters, they are poor choices for comparing to theoretical 
models. For comparisons to theoretical models we will use 
the number of L > L* galaxies A*a inside the spherical 
radius r^vA such that the enclosed galaxy density is Ajv 
times the average density of galaxies. Given a cluster with 
N« c galaxies brighter than inside r out we determine r& 
and A*a by solving the defining relations 



N, 



*A 



4-7T 

N* c F(r NA /r c )/F(c) = —n 



,Ar 3 NA T[l + a,l]. 



For reasons of computational efficiency and stability we 
keep c fixed when we determine tjva- In most theoretical 
models, the distribution of galaxies basically trace s the 
distribution of mass (Katz, Hern quist 
Gardner et aljl999 



Weinberg 1999 



1999; White, Hernquist 



Pearce et al. 

& Springel ^POlF However, if we assume the galaxies are 
unbiasedQ to isolate the same physical region we must set 
Ajv = Am / Q,m because the galaxy overdensity is defined 
relative to the average galaxy density while the mass over- 
density is defined relative to the critical density rather 
than the mean mass density. Thus, taking f2j\/ = 0.3, 
the number of galaxies brighter than inside the re- 
gion with the standard mass overdensity of Am = 200 

IS A» 666 = OMN^Fop with r A/2 00 = ?"jV666- 

Our best theoretical estimate of the conve rsion to Post- 
man et al. 



(22) 



[L99§ |2002|) and Donahue et al. (|200lD is 



11 A. 



3/4 
*666 



for A c i ^ fO. The slope difference is created by the dif- 
ference between a fixed r c i — l.bh^ Mpc aperture for 
A c i and a variable aperture rjv666 f° r A*666- In prac- 
tice, both the normalization and slope could differ from 
this estimate because of differences in the luminosity func- 
tions, wavelengths (K band versus 1 band) and matched 
filter structures. A simple check is to com pare A r i and 
A* 66 6 values for clusters of different Abell ( |l958) cl uster 
richness classes. Our overlap with the Abell (1958) cat- 
alog is limited because the average redshift of the Abell 
clusters is significantly higher than our 2MASS catalog. 
A quick match based on NED cluster names found 31, 
39, and 15 Abell richness class 0, f and 2 clusters with 
fractional errors in Af^e smaller than 25% in the cata- 
logs of §[s|. We find an average richness and dispersion of 
( A» 666 ) = 5.3 ± 2.7, 8.9 ± 4.3 and 13.4 ± 6.4 for Abell rich- 
ness classes of 0, f and 2 respectively. The ratios of the av- 
erage A*666 values roughly track the ratios of the number 
of gala xies a ssociated with each richness class. Postman 
et al. (|2002|) estimate that A c; = 50, 80 and 130 should 
correspond to the typical Abell richness classes 0, 1 and 
2 clusters which differs significantly from our estimated 
conversion to A c/ (Eq. ||) of A cl = 41 ± 16, 60 ± 22, and 
84 ± 28 for the same richness classes. While using Abell 
richness classes to compare modern algorithms can hardly 
be recommended, the comparison suggests that we should 
explore empirical conversions between A c ; and A*666- 

We find clusters with richnesses as low as A*666 — 0.1 
in our catalogs, and this requires some explanation. First, 

4 The 'galaxies' in the simulated catalog are very nearly unbiased. 
The bias of the 2MASS galaxies is currently unknown, but we shall 
argue later it is likely they are relatively unbiased. 
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Table 1 

Adopted Offsets Between X-ray Data 



Name 


Offset 


Error 


-^cluster 


BCS 


0.036 


0.005 


351 


de Grandi 


0.246 


0.009 


145 


Cruddace 


0.039 


0.008 


109 


Ponman 


-0.084 


0.035 


9 


RASSCALS 


0.067 


0.008 


55 


HIFLUGCS 


= 


= 


193 


XBACS 


0.040 


0.004 


377 


NORAS 


0.051 


0.004 


299 



Note. — The offsets, defined by the loga- 
rithmic correction needed to match ljm fluxes 
quoted by Reiprich & Bohringer (2002), ap- 
plied to the X-ray luminosities from trie dif- 
ferent samples we analyzed. The offsets were 
generally small, with only that a pplied to the 
sample of de Grandi et al. (2000) being signif- 
icant. 



Table 2 

Estimates of Halo Occupancy Function 



Data 



Mass Scale 



Correlation 



Transformations zero-point A slope B Comments 



Synthetic True 

M 200 r-j A r *666(M 20 o) 

0-200(^200), Eq. g£| N*666(a) 

dn/dM200 dn/dN*666 



1.32 
1.25 ± 0.03 
1.23 ±0.04 
1.43 ± 0.05 
1.37 ±0.05 
1.43 ± 0.09 



0.98 
0.89 ±0.02 
0.94 ±0.03 
1.13 ±0.04 
1.04 ±0.04 
1.09 ±0.08 



2MASS CT200 (M200), Eq. 



iV.666 (T x ) 

iV*666(ix) 
Average 

W, 666 (<7) 

AT*666(Tx) 

N*66&(Lx) 

Average 
dn/dNteee 



Adopted Standard 



Nthresh = 3 
Nthresh = 5 
Nthresh = 10 



T X (M 200 ), Eq. [| 



dn/dM^oo 



a(T x ), Eq.[| 
a(L x ), Eq. 
a(T x ), Eq.|| 



Lx(T x ), Eqs. 
Lx(T x ), Eqs. 



1.28 ±0.03 

1.24 ±0.05 

1.25 ±0.05 
1.22 ±0.04 
1.25 ±0.03 

1.57 ±0.04 
1.60 ± 0.07 
1.59 ±0.06 
1.56 ±0.06 
1.56 ±0.06 

1.58 ±0.02 
1.49 ± 0.04 
1.49 ± 0.04 
1.54 ±0.05 
1.44 ±0.17 



0.90 ±0.02 
1.13 ±0.09 
1.06 ±0.06 
0.98 ±0.06 
1.02 ±0.10 
1.10 ±0.02 
1.39 ±0.11 
1.31 ±0.08 
1.05 ±0.07 
1.19 ±0.08 
1.21 ±0.14 
1.13 ±0.03 
1.08 ±0.03 
1.13 ±0.06 
1.10 ±0.09 



Bright X 
All X 
Bright X 



Bright X 
All X 
Bright X 
Bright X 

Nthresh = 3 
Nthresh = 5 
Nthresh = 10 



Note. — The halo occupancy function has the form logAf,666 = A + Blog(Al2oah/10 15 Mq). Except for the fits to the 
cluster number function, all results are for a standard cluster sample consisting of the systems with at least N v > 5 
associated redshifts. The cluster number function estimates are for threshold galaxy numbers of Afresh = 3, 5 and 10. 
The halo mass function dn/dM2oo is that of Jenkins et al. ( [2001 ) converted to M200 from the original FoF masses assuming 
M200 = OSGMpoF as found in our simulations for the same FoF linking length. The N-body simulations used to generate 
our synthetic catalog have the same initial parameters. Models based on X-ray data samples limited by Tx > 1 keV and 
Lx ^ W 42 h 2 ergs/s contain the comment "Bright X". The Average entries are simply the average coefficients and their 
scatter. The Adopted standard combines the 2 Average relations and the N t f lrea h = 5 result for the 2MASS data. 
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low values of -ZV»666 only imply that it is unlikely the clus- 
ter contains any bright (L > L*) galaxies. It does not 
imply the cluster is unlikely to contain galaxies. For ex- 
ample, an -/V*g 66 = 0.3 cluster should contain 3 (7) galaxies 
with L > L*/10 (L > L*/100) inside its virial radius. We 
can routinely detect such systems provided the redshift is 
low enough for us to detect the lower luminosity galaxies. 
Second, we detect the systems in projection, and the num- 
ber of galaxies projected inside the virial radius is larger 
than the number inside the spherical virial radius. This is 
a modest effect for the NFW profile, but real clusters are 
embedded in more extended infall regions which boost pro- 
jection effects. Third, there are significant errors in these 
low estimates of iV*666 because there are so few galaxies. 
Thus, these low richness clusters can be real virialized sys- 
tems, but they must be treated with care. 

Finally, we have estimates of the velocity dispersion of 
the clusters. Although we optimized the cluster velocity 
width a c k as part of our matched filter, we do so at low 
resolution (150 km/s) and it includes contributions from 
galaxies at large distances from the cluster center. To bet- 
ter mimic a virial velocity for a cluster we use a member- 
ship probability weighted estimate of the velocity disper- 
sion. We take all galaxies with measured redshifts, mem- 
bership probabilities pi > 0.5 and projected separations 
less than the model outer radius R < cr c — 0.8/i _1 Mpc 
and estimate a mean redshift 



and a velocity dispersion 



(I» 2 Uczi-(cz)) 2 Pi 



(23) 



(24) 



which we then correct to the rest frame cr = <Jo/(l + z). 
The leading term plays the role of the familiar N / (N — 1) 
factor needed to make the variance of the velocities about 
the mean an unbiased estimator of the velocity dispersion. 
The weighting by membership probability plays an equiva- 
lent role to the clipping of outliers in standard approaches 
to estimating cl uster velocity dispersions (see discussion 



m Borgani et al. |l999|) . and the method is similar to that 



used by Barmby & Huchra ( |l998 ) 



3.7. The Cluster Number Function 

We can also estimate the cluster number function, dn/dN, 
which is the distribution of clusters with respect to the 
number of member galaxies. If the number of member 
galaxies, N(M), is a simple function of the cluster mass 
then the number function is simply related to the cluster 
mass function dn/dM by 



dn 
dN 



dn 
dM 



dM 



dN 



(25) 



If we assume that the cluster mass function is known, then 
we can infer the halo occupancy function by determining 
the function N(M) needed to transform the assumed mass 
function into t he ob served number function. We use the 
Jenkins et al. ( [200 1| ) CDM mass function, which should 
closely match that of our N-body simulation since it is 
based on numerical simula tions with similar cosmologi- 
cal parameters (see White 2002 for further discussion). 




100 

Aln^(O) 



Fig. 1. — Completeness as a function of Aln£(0) for clusters 
with M cut > 10 1 */i -1 M Q and cz < 10000 km/s. The solid line 
shows the completeness, the dashed line shows the fraction of clus- 
ter candidates which are false positives. The upper dotted line (at 
Aln£(0) = 100) shows the fraction of cluster candidates which are 
real clusters with masses M cu t/3 < M < M cu t. The lower dotted 
line (at Aln£(0) = 100) shows the fraction of cluster candidates 
which cannot be identified with any halo. 



- MS10 14 M o 



O 
In- 



completeness 



3xl0 13 M m gM<10 14 M„ 



unmatched 



Aln40) = 150 



false pos. 



cz (km/s) 



Fig. 2. — Completeness as a function of redshift for clusters 
with M cut > 10 14 /i _1 M© and Aln£(0) = 150. The solid his- 
togram shows the fraction of the M > M cu t clusters detected in 
each bin, the dashed histogram the fraction of candidates which 
are false positives. The dotted histogram, decreasing with redshift, 
shows the fraction of cluster candidates which are real clusters with 
-Mc U t/3 < M < A/cut, while the rising dotted histogram shows the 
fraction of candidates which cannot be identified with any halo. 
The vertical lines mark the redshifts where the average M = M cu t 
cluster has 10 or 3 galaxies. 
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The Jenkins et al. ( 2001 ) mass function was derived us- 
ing Mp p masses for the clusters, so we transformed it 
from dn/dMpop to dn/dM200 assuming the average rela- 
tion that M200 = 0.66Mp o p we observe in our simulations 
for the same linking length. This meth od is s imilar to the 
approach used by Berlind & Weinberg ( 2002| ) and Berlind 
et al. ( |2002| ). 

We can estimate the nu mber function using the V/V max 
method (e.g. Moore et al. 1993 ) as follows. At rcdshift z, 
a cluster normalized by N*qqq contains 

Ng(z) = AU 66 r[l + a, L lim (z)/L*]/r[l + a, 1] (26) 

galaxies, where L\[ m (z) is the luminosity corresponding to 
the survey magnitude limit at redshift z. Suppose we de- 
tect all clusters with at least N g > iV t hresh galaxies. Then 
we detect clusters in the volume Vmax(-/V*666) defined by 
the comoving survey volume out to the redshift zn m which 
solves iVthresh = N g (z]i m ). The integral number function 
is then 



n(> N) = 



E 

Af>N 



v max (Afy 



(27) 



where the sum includes only clusters with more than the 
threshold galaxy number at their observed redshift. For 
large values of iVthresh we are less sensitive to the Poisson 
errors in estimating iV*666 an< i minimize the risk of includ- 
ing false positives as clusters. This comes at the price of 
increased Poisson errors in the number function, because 
we include fewer clusters, and increasing sample variance 
because we include less volume. We limited the sample 
used and V max to cz < 15000 km/s and used bootstrap 
resampling of the catalog to estimate the uncertainties in 
the number function. 



and cz < 10 km/s, we miss only 5 at our likelihood cut- 
off. Of these, one was merged into another cluster, 4 
were systems with M ~ 10 4 /i _1 M0 and fewer galaxies 
than expected given their mass. We can achieve very high 
completeness out to the redshift where the typical cluster 
contains only 3 galaxies. However, all samples with high 
completeness also have high false positive fractions. If the 
likelihood threshold is kept high enough, the vast majority 
of these false positives are real clusters with masses just 
below the cutoff mass scale - very few of these false pos- 
itives lack identification with any M > 3 x 10 13 /i" 1 M Q 
halo. For lower likelihood thresholds the mass range of 
the false positives broadens and then slowly becomes dom- 
inated by unmatched systems where we could not identify 
the detection with a cluster in the input catalog. 

Fig. H shows the redshift dependence of the complete- 
ness and false positive rates after choosing Aln£(0) to 
have a 50% false positive rate. The completeness declines 
slowly with redshift, but remains close to 90% complete 
from < cz < 10000 km/s. The false positive rate aver- 
ages 50%, and the false positives consist almost entirely 
of real clusters with masses between 3 x 1O 13 /i _1 M and 
10 14 /i _1 M Q . False positives that correspond to no clus- 
ter become important only for redshifts beyond which a 
10 14 /i _1 M Q cluster contains fewer than 3 galaxies. Within 
our cz < 10000 km/s selection window there is a total vol- 
ume of 4.1V where V = (IOO//1) 3 Mpc 3 , containing 101 
clusters more massive than 10 14 /j -1 Mq. Using these clus- 
ters to estimate the mass function would be limited by the 
correction for the contamination by lower mass clusters 
rather than the correction for completeness or statistical 
uncertainties including sample variance. 

The true completeness of the recovered cluster sample 



4. TESTS WITH THE SYNTHETIC CATALOG 

We start by testing the algorithm and our recovery of 
the halo occupancy function in the synthetic data designed 
to mimic the real 2MASS galaxy sample. 

4.1. Completeness and contamination 

We first consider the completeness of the resulting cata- 
logs. As we discuss in WK, the likelihood associated with a 
cluster roughly scales with the number of member galax- 
ies, so the distance at which we can detect clusters can 
be roughly estimated from the number of galaxies. In 
our models, the average number of cluster galaxies drops 
to N FoF = 10 (3) at redshifts of cz = 12000 (17000), 
7400 (13000), and 2700 (7500) km/s for clusters of mass 
Mh/M Q = 3 x 10 14 , 10 14 and 3 x 10 13 respectively The 
spread in the likelihoods for a fixed mass and redshift will 
be considerable because of the Poisson variations in the 
number of galaxies bright enough to be visible and be- 
cause of the random variations in the numbers of measured 
redshifts for the fainter galaxies. In WK we showed that 
the cluster likelihood essentially scales with the number 
of member galaxies, so that by scaling the selection likeli- 
hood as Aln£(z) = C(z)Aln£(0) we can extract clusters 
with a nearly constant mass threshold as a function of red- 
shift. The choice of the zero-redshift likelihood threshold, 
Aln£(0), is related to the mass cut and its completeness. 

Fig. pi illustrates this for model clusters with M > 
10 14 /i-% Q . Of the 101 clusters with M > 1O 14 /i^ 1 M 




|QU JQ13 10 " 10" 



^200 ( M h ) 

Fig. 3. — The number of galaxies ^666 as a function of the virial 
mass M2oo- The points show all detected clusters with at least 
N v > 5 associated redshifts which have been matched to the input 
cluster catalog. The heavy solid line shows the true relationship and 
the light dashed lines show the estimated relation and its width as 
estimated from the variance of the points. 
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is higher than we have quoted because of the way we have 
generated the synthetic catalog. In regions of high den- 
sity, where we expect the most massive clusters, the FoF 
algorithm with a linking length of 0.2 is prone to merg- 
ing sub-clumps into larger objects. Our search algorithm 
instead detects the individual sub-clumps with lower like- 
lihoods than would be expected for an object as massive 
as the FoF cluster. The sub-clumps may have too low a 
likelihood to be included (creating a false impression of in- 
completeness) or multiple clumps may be included (adding 
to the false positives). Most peculiar high FoF mass but 
low detection likelihood systems could be traced to this 
feature of the FoF algorithm. 

4.2. Halo occupancy function 

Given a well-defined cluster catalog, the next step is to 
examine how well the cluster parameters agree with the 
true properties of the clusters. We first test our ability to 
recover the true halo occupancy function N(M) given our 
knowledge of the halo masses for the synthetic catalogs. 
For the real 2MASS data we will consider cluster X-ray lu- 
minosities and temperatures or galaxy velocity dispersions 
as surrogate, but independent, estimates for the mass. 

We test our ability to recover the halo occupancy func- 
tion by comparing the number of galaxies iV*666 inside the 
An = 666 contrast level. This corresponds to the num- 
ber of galaxies inside a mass contrast level of Am = 200 
and should, on average, match 0.66N*p o p. There are two 
critical issues to estimates of the halo occupancy function. 

First, definitions are critical! The "number of galax- 
ies in a cluster" must be precisely and similarly defined 
both for the algorithm used to find the clusters and the 



theoretical model used to interpret the results. Failure to 
match the definitions, for example by using N* c instead of 
iV»666, makes it impossible test the algorithm (for our syn- 
thetic catalog) or to interpret the results (for the 2MASS 
catalog). In particular, N rc > A» 6 66 f° r l° w mass clus- 
ters where J"M2Q0 < cr c — OSh^ 1 Mpc and iV* c < A»666 
for high mass clusters where rM200 > cr c — 0.8/i _1 Mpc. 
Thus, the slope of N* C (M) ~ M - 75 is significantly flatter 
than the slope of A»666 ~ M. We expect the same flatten- 
ing of the sl o pe w ill be found for A c ; as used by Postman 
et al. ( |1996| , |2002| ) and Donahue et al. fl200l| ) for similar 
reasons. 

Second, in estimating relations like N(M) it is impor- 
tant to select subsamples which minimize the introduction 
of mass and number-dependent biases. For example, esti- 
mating N(M) using a sub-sample selected such that the 
standard error in N is less than a threshold or that the 
number of galaxies in the cluster exceeds some threshold 
will preferentially include low mass clusters with high val- 
ues of N over those with low values of N. This biases 
estimates of N(M) to have flatter slopes than the true 
relation. 

With the logarithmic prior for A* c included in the like- 
lihood, our ability to estimate the halo occupancy be- 
comes very robust. Simple rcdshift cuts work well, but 
we will define our standard sample by systems which con- 
tained at least N v > 5 galaxies with redshifts. After 
selecting a cluster sample we fit the distribution as a 
power law including the formal uncertainties in the es- 
timate of A» 666 for each cluster. The results, illustrated 
in Fig. ||, are remarkably good. Above 10 13 ft. _1 M© the 
input relation is well approximated by a power law (see 
Table |), logA» 666 = A + Blog(M 2OO /i/lO 15 M ) with 




100 1000 



CT 200 (km/s) 

Fig. 4. — Comparison of the velocity dispersion <r estimated for 
the cluster candidate to the velocity dispersion 0-200 of the dark 
matter particles inside r200- Cluster candidates including at least 
N v > 10 (5 < N v < 10) redshifts are shown as the solid (open) 
points. The solid line indicates a = 0200 while the dashed line is 
the best- fit power law. The overall correlation is excellent, but there 
are outliers due to contamination. 



10" 10" 10" 10" 10" 




100 1000 



(j(km/s) 

Fig. 5. — The number of galaxies -/V„666 as a function of the 
estimated cluster velocity dispersion a for the synthetic data. Filled 
(open) points have Nv > 10 or (5 < N v < 10) velocities contributing 
to the dispersion estimate. The heavy solid line shows the true 
scaling, and the dashed lines show the best fit power law and its 
width as estimated from the variance of the points. 
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A = 1.32 and B — 0.98. With our standard sample we 
find A = 1.25 ± 0.03 and B = 0.89 ± 0.02. The error bars 
are small because of the large number of clusters (484) , so 
the fits formally disagree with the input relation by 3-4 
standard deviations. 

The uncertainties in the halo occupancy function are 
dominated by systematic errors arising from sample se- 
lection rather than statistical errors. For example, if we 
simply include all systems with cz < 15000 km/s, roughly 
the limit for detecting L„ galaxies, we find parameters 
(A = 1.20 ± 0.02 and B = 0.92 ± 0.02) which has a lower 
normalization, a steeper slope and more scatter about the 
mean relation. For systems with cz < 10000 km/s, we find 
A = 1.23±0.02 and B = 0.95±0.01. If we use a likelihood- 
dependent redshift cut such as cz < 10000 log(ln£) km/s, 
we find A = 1.26 ± 0.02 and B = 0.95 ± 0.01. We will 
adopt the N v > 5 sample as our standard from this point, 
and summarize the estimates of the occupancy function in 
Table g. 

In summary, we can recover the true halo occupancy 
function of the synthetic catalog reasonably well. The un- 
certainties are dominated by systematic errors at the level 
of 0.05 in the slope and 10% in the normalization of the 
relation. Some of these problems are due to the generation 
of the synthetic catalog rather than the search for clusters 
in the synthetic catalog. The FoF algorithm used to find 
clusters in the N-body simulation is prone to linking ob- 
jects which both the eye and our matched filter will regard 
as neighboring but separate objects. These then appear in 
our test as objects with high 'true' mass but low 'recov- 
ered' mass, possibly creating a modest bias towards flatter 
slopes and lower normalizations. While this aspect could 
be improved upon, we will leave exploration of 'unlinking' 
algorithms to future work. 

Unfortunately in the real data we do not know the in- 
trinsic masses of the halos and must use a surrogate esti- 
mate such as the cluster velocity dispersion, X-ray lumi- 
nosity or X-ray temperature. The surrogate we estimate 
as part of our algorithm is the cluster velocity dispersion 
tr, and in Fig. ^ we compare our estimated velocity disper- 
sions a based on Eq. ( p4| ) to the velocity dispersion of the 
dark matter particles o~2oo,dm inside fM200 of the cluster 
center. We select the sample for comparison with both 
our standard limits on the redshift (cz < 15000 km/s) 
and a minimum number N v > 5 of velocities to be in- 
cluded in the redshift estimate. Fit as a power law with 
(cr/10 3 km/s) = a(<j2oa,DM /10 3 km/s) b we find little offset, 
a = 0.93 ± 0.03, or slope difference, b = 1.04 ± 0.04. Re- 
stricting the analysis to systems with N v > 10 produces 
similar results, with a = 0.89 ± 0.05 and b = 0.94 ± 0.06. 
The scatter is consistent with the uncertainties in the ve- 
locity dispersion estimates. 

We can also estimate N*(a), as shown in Fig. 
simulations the velocity dispersion is related to 

by 
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(28) 



with a 10% dispersion about the power-law. The power 
law describing the input -/V»666(o"20o) relation has Nq = 
25.6 and x = 2.75 for 7V* 666 = A^cr/lO^m/sf. The ob- 
served relation, with our standard selection procedures, is 
nearly identical, with x = 2.76±0.08 and iV = 21.1±2.0, 



to the input relation. The uncertainties are large because 
both quantities have relatively large statistical uncertain- 
ties. These parameters change little as we adjust the sam- 
ple selection criteria. Alternatively, we can use Eq. ( pcf ) to 
transform from a to M20Q to estimate the occupation num- 
ber directly, with results very close to the input relation 
(see Table |). 

4.3. The Cluster Number Function 

Finally we examine how well we can recover the clus- 
ter number function dn/dN*666 of the synthetic catalog. 
Fig. ^ shows the distributions estimated using thre sholds 
of Afresh = 3, 5 and 10 in defining V max (see §3/7). The 
three estimates are mutually consistent, extending over 
two decades in JV*666- We can also compute the distri- 
bution for the (200ft.~ 1 Mpc) 3 N-body simulation volume 
from which our synthetic catalog is drawn. The charac- 
teristic volume for fair samples of the universe is roughly 
Vo = (lOOft^Mpc) 3 , which we shall use as a unit of 
volume. In these units the simulation has a volume of 
8Vq, and to generate the synthetic catalogs we periodi- 
cally replicate the simulation. The number function of 
low iV»666 clusters is derived from a small region of the 
simulation, while the number function of high N*666 clus- 
ters is drawn from several of the periodic repetitions of 
the data. There is also good agreement with the semi- 
analytic number function found by combining the the syn- 
thc tic ca talog's multiplicity function with the Jenkins et 



3.7 



al. (2001) CDM mass function following the procedures in 
(Eq. |25|) . The only significant difference is the absence 



of high 7X^666 clusters, but this is a real feature of the N- 




FlG. 6. — The number function dn/dN„eee for the synthetic data. 
The triangles, squares and pentagons show the results for Afresh = 
3, 5 and 10 and their statistical errors. The solid histogram shows 
the actual distribution in the (200/i — ^^Mpc) 3 N-body cube and its 
Poisson errors for the same bins. The smooth solid r-nr-^n shows 
the expected distribution based on the Jenkins et al. (2001) CDM 
mass function and the synthetic catalog's multiplicity function. The 
mass scale at the top is based on the true occupation function for 
the synthetic data. 
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body simulation used to produce the synthetic catalog. If 
we combine the mass function and the number function 
to estimate the occupancy function, we obtain mutually 
consistent estimates of iV*666(-M20o) f° r -^thresh = 3, 5 and 
10 which have slopes and normalizations slightly above 
(l-2cr) the true relation (see Table |J). 

5. 2MASS CLUSTERS 

With this understanding of the algorithm, we can now 
search for clusters in the real 2MASS data. We present no 
catalog at present pending a reanalysis of the full 2MASS 
data to a fainter limiting magnitude than is presently avail- 
able. Instead, we focus on the properties of the clusters. 
In any fitted relation, we have selected the cluster sam- 
ple used to perform the fit in exactly the same manner 
as was used for the synthetic catalog, and the order of 
the analyses parallels that used to test the algorithm on 
the synthetic catalog. Our standard sample consists of all 
galaxies identified fro m the 2MASS extended source cata- 
log (see Jarrett et al. 2000) with |6| > 5° to an extinction 
corrected magnitude limit of K< 12.25 mag. 

5.1. Tests of Completeness 

We start with two tests of the completeness of the algo- 
rithm for the real data. The first test compares the "shal- 
low" cluster catalogs generated using a bright subsamplc 
of the galaxy catalog, truncated at K< 11.25 mag, to the 
"deep" cluster catalogs generated using the complete sam- 
ple to K< 12.25 mag. The second test is to compare our 
cluster catalog to local X-ray surveys. 
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Fig. 7. — Detection of X-ray clusters as a function of redshift 
and X-ray flux fx- The solid points are X-ray clusters that were 
found and the open points are X-ray clusters which were missed. 
The squares show the clusters from the HIFLUGCS survey which is 
complete to fx = 2 X 10 — 11 ergs/cm 2 s (the horizontal line). The 
triangles show all clusters with X-ray flux measurements that we 
matched to our output catalog. The missed HIFLUGCS target at 
low redshift is the NGC 4636 group, and it is probably a failure in 
our matching methods rather than a true non-detection. 



5.1.1. Comparison of shallow and deep surveys 

We first discuss the results from comparing the shallow 
and deep cluster catalogs. With a one magnitude change in 
the survey depth, individual clusters will have significant 
changes in their membership, as a cluster which contained 
only L > (L > 0.1L*) galaxies in the bright subsample, 
would on average contain 3.4 (1.5) times as many galax- 
ies in the complete sample. Using the same termination 
criterion for the search, the shallow catalog contains only 
375 clusters, as compared to 1743 in the deep catalog. 

We first consider the reliability of the catalog by trying 
to identify shallow catalog clusters which have no coun- 
terpart in the deeper catalog. By cross-referencing the 
cluster assignments of the galaxies in the two catalogs we 
estimated a matching probability for identifying a cluster 
in the shallow catalog with each cluster in the deep cata- 
log. We included all galaxies with a 50% or higher cluster 
membership probability. The cluster matching probability 
can range from 0, if no galaxy assigned to a shallow cluster 
has been assigned to a deep cluster, to unity if every galaxy 
assigned to the shallow cluster is assigned to the same deep 
cluster. Only 4 of the 375 clusters lack a firm counterpart 
in the deeper catalog (defined by a matching probability 
< 50%), and an additional 8 may have ambiguous assign- 
ments (matching probability of 50-70%). Most shallow 
clusters have unambiguous counterparts, with 78%, 85% 
and 91% of the shallow clusters having match probabili- 
ties exceeding 95%, 90% and 85% respectively. The match 
probabilities are not unity because galaxies with modest 
membership probabilities can be dropped or assigned to 
a different cluster based on the deeper catalogs, reducing 
the match probability, even if the galaxies in the cluster 
cores remain perfectly matched. Thus, if we define a false 
positive as a cluster which will lack a clear counter part 
in a deeper survey, our method appears to have a false 
positive rate of 1-3% depending on the desired threshold. 
Be warned, however, that this false positive rate is not 
the same as the true false positive rates we considered for 
the synthetic data - a real but unvirialized overdensity of 
galaxies can be identified as a false positive in the syn- 
thetic data, but our deeper survey may still identify the 
clump as a cluster. 

Next we examine the relative completeness of the two 
catalogs. We divided the deep catalog into bins of -/V» 666 
and computed the fraction of the deep catalog clusters 
found by the shallow survey as a function of redshift. 
For each richness bin, the shallow survey is at least 80% 
complete until the redshift at which the least rich clus- 
ters in the bin are expected to have fewer than 3 galax- 
ies. For example, for the clusters in the deep catalog with 
1 < iV»666 < 3, corresponding to masses near 10 14 /i _1 M Q , 
the completeness of the shallow catalog is 92% (61%) for 
redshifts below that where an N*q$q = 1 (3) cluster is ex- 
pected to contain 3 galaxies. The deep catalog presumably 
has similar properties. 

The final comparison we make is to compare the esti- 
mates of A^*666 for the 363 well-matched clusters from the 
two catalogs. We find that 



log N^g™ 



(0.07±0.01)- 



(1.00±0.01)log< 6 c 6 c 6 p (29) 



with a variance of about the relation of 0.21 dex. Formally, 
the error estimates for N^qqq would have to be raised by 
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14% to make the variance consistent with the error esti- 
mates. The shallow catalog estimates of iV* 666 are higher 
by 17% on average (also found as the median difference). 
The offset is probably a real systematic effect - the shal- 
low catalog will preferentially include clusters with excess 
numbers of K < 11.25 mag galaxies because an upward 
Poisson fluctuation increases the likelihood of including 
the cluster in the catalog. The same cluster is unlikely to 
show the same upward fluctuation in the K < 12.25 mag 
catalog, producing a net bias. This interpretation pre- 
dicts that a comparison restricted to clusters in the shallow 
survey with more galaxies, corresponding to clusters with 
smaller uncertainties in -/V*666> should have less of a bias. 
For example, if we restrict the comparison to the 41 clus- 
ters in the shallow catalog with errors in log iV*666 smaller 
than 0.10, the average offset is reduced to 7% (0.03 ±0.02) 
from 17%. 

5.1.2. Comparison with X-ray catalogs 

The second test compares our complete catalog to X-ray 
selected samples. It is difficult to do so in a completely 
satisfactory manner because the selection methods are so 
different. We first compare our results to the 63 clus- 
ters in the HIFLUGCS X-ray flux limited sa mple (fx > 
2 x 10 -11 ergs/cm 2 s, Reiprich & Bohringer 20021 ). The 
HIFLUGCS survey covers most of the sky with \b\ > 20° 
plus small holes near the LMC, the SMC and the Virgo 
cluster. Fig. fj] shows that we find all HIFLUGCS clusters 
out to the redshifts where we expect even rich clusters 
to have too few galaxies to be detected by our algorithm 
(cz ~ 15000 km/s). Our automated matching system iden- 
tified counterparts to 97% (35 of 36) of the HIFLUGCS 
clusters with cz < 15000 km/s. We missed one very low 
redshift system, the NGC 4636 group at cz = 1320 km/s, 
but closer examination strongly suggests that this is a 
matching and coordinate problem rather than a genuine 
failure. Between 15000 km/s and 20000 km/s we find 
67% (8 of 12) of the HIFLUGCS clusters. We miss the 
lower temperature clusters ((Tx) = 4.3 keV) and find the 
higher temperature clusters ((Tx) = 6.6 keV). At higher 
redshifts, an increasing fraction of X-ray clusters are gen- 
uinely missed. Spot checks of these regions show only 0-2 
galaxies at the position and redshift of the X-ray cluster. 

We can also use our N*qqq estimates to estimate an X- 
ray flux (based on the correlation between iV*666 an d the 
X-ray luminosity in Eq. [56]). This leads to 18 (14 below 
cz = 15000 km/s) clusters whose -/V* 66 6 values and co- 
ordinates suggest the cluster should have an X-ray flux 
large enough to be included in HIFLUGCS. These flux 
estimates are not very accurate, but we show their distri- 
bution, in Fig. 0. They can be divided into three loose cat- 
egories. First, there are 7 very low redshift systems with 
cz < 2000 km/s. Some have very high likelihoods in our 
catalog (the two most prominent are the Eridanus group 
and the Ursa Major cluster which are the 26th and 38th 
clusters found), but no published X-ray studies. These sys- 
tems have virial sizes significantly larger than the ROSAT 
field of view. Second, there are 5 at intermediate red- 
shifts cz ~ 5000 km/s. The two most prominent sys- 
tems here, Abell S0805 and Abell 3574, are the 9th and 
17th clusters found. Finally, there are 6 systems above 
cz > 10000 km/s of which the most prominent are Abell 
1913 and Abell S0740. After considerable and time con- 



suming effort, we gave up on tracking down the origins 
of these objects. Many of the systems are well studied 
clusters with X-ray emission (e.g. Abell S0805) that do 
not appear in any of the ROSAT catalogs we have used 
for our comparisons. Some are systems contaminated by 
X-ray emission from AGN (e.g. Abell 3574). The very 
low redshift systems have angular extents comparable (or 
larger) than the ROSAT aperture. In short, understand- 
ing these systems in detail requires a careful analysis of 
the ROSAT data which is beyond the scope of our pr esent 
effort. We note, however, that Donahue et al. ( 2002 ) also 
find examples of optically detected clusters which appear 
to have fainter than expected X-ray counterparts. 

Fig. ^ also shows all other clusters for which we have 
found a ROSAT X-ray flux measurement but are not de- 
rived from complete, flux-limited samples. As these are 
largely derived from the RASS survey as well, they tend 
to have X-ray fluxes fx <; 2 x 10~ 12 ergs/cm 2 s. The 
automatic matches find 84% (178 of 213) of the clusters 
with X-ray flux measurements in our input local cluster 
catalog and cz < 15000 km/s. As with the HIFLUGCS 
sample, some of the mismatches at low redshift are due to 
cross-matching and coordinate problems. 

In summary, by comparing our standard cluster catalog 
to either a shallow catalog or X-ray cluster catalogs, we 
can show that we achieve very high completeness up to the 
redshifts where clusters contain too few galaxies for robust 
detection (about 3 galaxies). It is difficult, however, to use 
either comparison to robustly estimate a false positive rate. 
The comparison of the shallow and deep catalogs can only 
set a lower bound (of a few percent) on the false positive 
rate. The HIFLUGCS sample sets an upper bound of 40%, 
but this mainly reflects the limitations of the RASS and 
our ability to estimate X-ray fluxes from iV*666 rather than 
a true estimate of the false positive rate. 

5.2. Further Checks of the Velocity Dispersion 

Before proceeding to estimates of the halo occupancy 
function, we made three additional checks of our velocity 
dispersion estimates, comparing directly with other pub- 
lished values and through comparisons to X-ray observa- 
tions of the clusters. All fits of variable correlations are 
performed as \ 2 fits including the errors in both variables 
and including all clusters with at least N v > 5 associated 
redshifts that we could match to the other data. 

First we checked our velocity dispersions again st the ve- 
locity dispersion s o pu b from Girardi et al. (1998; Table 2) 
and Wu et al. ( 1999| ). We matched 98 clusters with at 
least N v > 5 redshift measurements to these previous es- 
timates, and we found good agreement within the rather 
large errors (Fig. |^). If we fit a power-law, our a is related 
to o-p U b by 



lOOOkm/s 



= (1.09 ±0.05) 



G pub 



1000km/s 



1.06±0.08 



(30) 



with a dispersion of 0.16 dex that is consistent with the 
uncertainties. If we restrict the fit to the 69 systems 
with N v > 10 redshift measurements, we find essentially 
the same parameters. Hence our membership probability 
weighting method for determining velocity dispersions is 
consistent with the more standard methods based on re- 
jecting outliers. 
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Fig. 8. — Comparisons between our velocity dispersion estimates 
and published estimates. Filled points have N v > 10 velocities and 
open points have 5 < N v < 10 velocities. The dashed lines show 
the best fit relation and its width, which would lie on the solid line 
if the agreement was perfect. 



We also find 173 (108) clusters with X-ray luminosities 
(temperatures) and at least N v > 5 gala xies with redshifts 
from from the sources summarized in §3.6. Figs. ^| and 
[To| show our estimates of the relations between the X-ray 
luminosity Lx and the X-ray gas temperature Tx to the 
velocity dispersion estimates. The correlation between the 
velocity dispersion and the X-ray luminosity, including er- 
rors in both quantities, is 



log L 44 = (0.20 ± 0.07) + (4.47 ± 0.24) log 



10 3 km/s 



( 31 ) 

where L44 = Lxft.^/10 44 ergs/s. The variance in the X-ray 
luminosity at fixed velocity dispersion is a (typically) large 
0.76 dex. If we restrict the fit to systems with N v > 10 the 
normalization of 0.18±0.08 is little changed, but the slope 
steepens to 4.69±0.28 with a reduced variance of 0.55 dex. 
The slope and normali zation are close to those found by 
Mulchaey & Zabludoff fll998| , zer o poi nt 0.48 ± 1.09, slope 

1+0.9 
-2.0> 



4.29±0.37) or Mahdavi & Geller (|200l| , zero point 0.40 



slope 4.4^Q'g). Wu et al. (1999) found a range of slopes 



from 2.56 ± 0.21 to 5.24 ± 0.29 depending on their fit- 
ting methodology The zero-point uncertainties wc cite 
for these published relations are overestimates because we 
had to correct the relations to a velocity scale 10 3 km/s 
from one of 1 km/s without knowing the full covariance 
matrix of the fit. For the 108 clusters with X-ray temper- 
atures, we find 



log 



N = (0.01±0.02)+(0.63±0.04) log f ) 

(32) 



10 3 km/s 

with a spread of 0.16 dex, which is close to the rela- 
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Fig. 9. — The velocity dispersion estimate a as a function of the 
X-ray luminosity Lx- Filled points have N v > 10 velocities and 
open points have 5 < N v < 10 velocities The light solid line is the 



correlation from Mulchaey & Zabludoff (1998) .and, the heavy solid 
line is the correlation from Mahdavi & Geller ( 2001 ). 
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Fig. 10. — The velocity dispersion estimate a as a function of the 
X-ray temperature Tx- Filled points have N v > 10 velocities and 
open points have 5 < N v <" 1 velocities. The light solid line is the 
fit from Wu, Fang & Xu (1998) with slope 0.56, the heavy solid line 
the fit with slope n fi7 The heavy dashed line is the estimate from 
Girardi et al. ( fl998| ). 
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tions found by Mulchaey & Zabludoff (1998, zero point 
-0.01 ±0.04, slope 0.51 ±0.05), Wu et al. t fiMJj , zero point 
-0.00 ± 0.03, slope 0.65 ± 0.02), and Girardi et al. ( |l998 , 
zero point -0.01 ± 0.03, slope 0.62 ± 0.04). Fitting to 
the 79 systems with N v > 10 gives the same parameters 
(0.01 ± 0.02 and 0.62 ± 0.04) with a reduced scatter of 
0.12 dex. The similarity with these published analyses sug- 
gests that our method for determining velocity dispersions 
is comparable to those used for other studies despite the 
small numbers of galaxies used in our dispersion estimates 
compared to these more extensive surveys of individual 
clusters. 

5.3. The Halo Multiplicity Function 

We now derive the halo multiplicity function based on 
the three observables, velocity dispersion, X-ray luminos- 
ity and X-ray temperature, which we can later relate to 
the cluster mass. We first derive the observed correlations 
and then estimate the conversion to the desired correlation 
with cluster mass. 

Fig. [ll shows that the observed relationship between 
velocity dispersion and galaxy number is very similar to 
that in the synthetic data (§^, Fig. ||). The best fit relation 
for the 939 systems with N v > 5, including the errors in 
both quantities, is 

log N*666 = (1-37 ± 0.03) + (2.63 ± 0.06) log(cr/10 3 km/s) 

(33) 

compared to 1.32 ± 0.04 and 2.76 ± 0.08 for the zero point 
and slope of the fits to the synthetic catalog (1.41 and 2.75 
for the input relation). Only one system is dropped from 
the fits. The scatter in N*qqq at fixed dispersion is slightly 
larger in the real data (0.53 dex versus 0.50 dex). If we 
restrict the fit to the 238 systems with N v > 10, we obtain 
consistent parameters (1.31 ± 0.04 and 2.60 ± 0.10) with a 
reduced scatter of 0.41 dex. While the significance of the 



fit in Eq. (|3^) is quite high, the very large scatter makes it 
hard to see by eye. Therefore as a check we also used the 
dispersion in the distribution of galaxy-group velocities in 
'stacked' clusters, binned in 5 logarithmic bins of iV*666j 
to make the trend more clearly visible. We obtained a fit 
very close to Eq. (|3^). 

Next we compare the halo occupation number to the 
X-ray properties of the clusters, as shown in Figs. O and 



13. We must face two problems in correlating N^qqq with 
the X-ray data. First, we find that the scatter in the cor- 
relations is significantly larger than is consistent with the 
uncertainties in either quantity. In part this is due to the 
presence of significant intrinsic scatter in the X-ray proper- 
ties of clusters. Second, there is considerable evidence for 
a break in the X-ray properties between groups and clus- 
ters, generally believed to be due to significant cooling and 
heating from star for mation processes in th e low er mass 
systems (David et al. 1993| ; Ponman et al 
Jones & Forman |l997| ;^ IIen fc Fabian [i"998 
19981 Arnaud & Evrard |1999|; Helsdon & Ponman &000 



199J; White, 
Markevitch 



Finoguenov, Reiprich & Bohringer 2001). Between the in- 
trinsic scatter and the break in the properties, the param- 
eters for power-law fits to X-ray correlations can depend 
on the fitting methods and the data used. 

In order to include the effects of these problems we 
made two modifications to our fitting procedures. First, 
we added a logarithmic systematic error (or intrinsic scat- 
ter) <7 sys in quadrature with the measurement errors for 
the fitted variables. We then adjusted a sys so that the fit 
has x 2 — Adof ■ Second, we fit power-law relations includ- 
ing the X-ray data only for the more massive clusters with 
T x > 1 keV and L x > 10 42 /i 2 ergs/s. This reduces the 
biases in the correlations from attempting to fit the lower 
mass systems where non-adiabatic effects begin to modify 
the X-ray properties while leaving enough systems to ob- 
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Fig. 11. — The number of galaxies iV*666 as a function of the 
cluster velocity dispersion a for the 2MASS data. Points and lines 
as in Fig. pi 
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Fig. 12. — The number of galaxies N* versus the X-ray tem- 
perature Tx- The dashed lines show the best power law fit for 
Tx > 1 keV and the average dispersion of the data about the fit. 
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tain a statistically significant fit. As in all our standard 
fits we include only the systems with N v > 5 associated 
redshifts. 

If we now find the best power-law relation between Lx 
and Tx, we find 



for er. 



0.24 and with a scatter of 0.33 dex. If we in- 



log L 4 4 = (0.21 ± 0.04) + (2.66 ± 0.12) log 



—) 

6keV/ 



(34) 



based on 84 clusters with cr sy s = 0.08 and a mean scatter of 
0.30 dex. The normalizatio n is co nsistent with our adopted 
standard from Markevitch ( 199S ) in Eq. (|l8|), but it has a 
significantly steeper slope. If we include no limits on the 
X-ray properties we find a similar zero-point of 0.28 ± 0.05 
but a still steeper slope of 3.04 ± 0.10 and a larger s catter 
of 0.35 dex. This fit matches that of Xue & Wu J2000[) 
under similar assumptions. There is clearly a break in the 
slope of the Lx~Tx relation near Tx = 1 keV which is 
incompatible with a single power law providing a good fit. 

We first fit the number-temperature relation as a power 
law using 84 clusters, as shown in Fig. [lj, to find that 

logAU 66 = (1.38±0.05) + (2.09±0.17)log(T/6keV). (35) 

with a sys = 0.12 dex and a mean scatter of 0.35 dex. This 
relationship changes little if we drop the restrictions on the 
X-ray properties. The fit for all 102 clusters with X-ray 
temperatures, luminosities and N v > 5, has a zero-point of 
1.39±0.05, a slope of 1.97±0.12, and a negligible increase 
in cr sys and the scatter. 

Fig. O shows the results for fitting the 153 clusters with 
L x > W 2 h- 2 ergs/s and N v > 5. We find 



log AT, 



666 



(1.29 ± 0.05) + (0.75 ± 0.05) logL 44 . (36) 
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Fig. 13. — The number of galaxies N t versus the X-ray luminosity 
Lx ■ The dashed curves show our best power-law fit to the relation 
for Lx > 10 42 /i 2 ergs/s. The solid curves show our theoretically 
estimated converpincL. (Eq. Ba ) of the A c ;-Lx correlation found by 
Donahue et al. ( [200 1| ) and the uncertainties in its normalization. 
We can match the two relations by raising the normalization of the 

conversion to A r i ~ 21A?f/f c . 



elude all 173 clusters with X-ray luminosity measurements 
and N v > 5, the zero-point becomes 1.23 ± 0.04 and the 
slope of 0.64±0.03 is significantly flatter. The distribution 
of clusters in Fig. [l3| is clearly inconsistent with a simple 
power law if we extend the fit to low X-ray luminosities, 
so we will not consider this case further. As a consistency 
check, we can combine the N^qq-Tx and N^qq-Lx rela- 
tions (Eqs. |36| and ^) to infer a zero-point of 0.12 ± 0.09 
and a slope of 2.79 ± 0.29 for the Lx~Tx relation which 
are consistent with the direct fit in Eq. (|3^). Similarly, if 
we combine the A^ge-cr and N* 6e6 -Tx relations (Eqs. |33| 
and |3E| ), we infer a zero-point of 0.00 ± 0.02 and a slope 
of 0.79 ± 0.07 for the a-Tx relation which are reasonably 
consistent with the direct fit in Eq. (j32]). 

We can also compare our results to those of Donahue et 
al. (2001), who compared the properties of clusters found 
in X-ray surveys with those found using the Postman et 
al. (1996) matched filter algorithm. In this algorithm, clus- 
ters are characterized by their total luminosity in units of 
L», A c ;, where we estimated a conversion of A c / = llA^g 
(see Eq. p2| ) between our cluster normalizations. Given 
this relation, we can convert their estimated correlation 
with X-ray luminosity into our units, 

log AU 66 = (1.8 ± 1.0) + (0.8 ± 0.2) log L 44 . (37) 

While the slope and the normalization are consistent with 
our estimate given the uncertainties in their relation, the 
normalization is systematically higher. If we estimate 
the conversion between A c / and N^qqq by matching the 
X-ray correlations for the two variables, we find that 
A c i — 21A^gH, where the slope is consistent with our the- 
oretical estimate (Eq. |22] ) but the normalization is nearly 
doubled. Since the sense of the change is the same as 
that needed to better match A c ; and ^666 for clusters of 
the same Abell richness class (see §3.6), we will adopt an 
empirical relation for the conversion of 



A,. 



21N. 



3/4 
*666 



(38) 



for subsequent comparisons. Using the original Lx and A c ; 
data points, we confirmed that this empirical conversion 
was correct, while the theoretical conversion from Eq. ( |22| ) 
showed an offset whose origin we do not fully understand. 

The final step in the analysis is to convert the ob- 
served occupancy functions (N^qqq (c) , ^,666 (Tx) and 
^♦666 (Lx)) into an estimate of the occupancy function 
A r *666(-^20o) as a function of the virial mass. The results, 
summarized in Table |[ are dominated by systematic errors 
associated with the choice of the mass scale. The average 
normalization A = 1.25 ± 0.03 and slope B = 1.02 ± 0.10 
found when we set the mass scale using the velocity dis- 
persion (<r(M2oo), Eq. ^8|) are lower than the average nor- 
malization A = 1.58±0.02 and slope B = 1.21±0.14 found 
when we set the mass scale using the X-ray temperatures 
(M 20 o(T x ), Eq. |l|). The results based on the same mass 
scale are largely independent of the observed occupancy 
function used to make the estimate. This is particularly 
true of the normalization A, where the scatter between the 
individual estimates is less than the formal errors, and less 
true of the slope B where the scatter between the individ- 
ual estimates is somewhat larger than the formal errors. 
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A shallower mass-temperature relation with T ~ M 1 / 2 
rather than T ~ M 2 / 3 , perhaps better suited to the mix 
of high and low temperature clusters we use, would bring 
the two estimates of the slopes into agreement. In addition 
to these systematic differences, recall that in the synthetic 
data our fits to the occupancy function underestimated the 
norm aliz ation by (19 ± 7)% and the slope by 0.06 ± 0.03 
(see SO and Table 0). 



5.4. The Cluster Number Function 

Next we estimate the cluster number function, dn/dN*666> 
for the 2MASS sample as shown in Fig. 113. The number 
function for the real sample extends to nigher values of 
iV* 666 than that for the synthetic sample (see Fig. ^) but 
otherwise looks very similar. The results for the three val- 
ues of -/Vthresh are again mutually consistent. Note that for 
large iV*666 the number function is systematically above 
that of the synthetic catalog. 

Comparing our results with previous estimates of the 
same quantity is quite difficult, due to different conven- 
tions for estimating 'richness'. The best we can do is to ap- 
ply reasonable transformations based on scaling relations 
between the widely differing definitions, realizing that this 
method is far from perfe ct. 

Donahue et al. (2001) and Postman et al. ( |2002 ) have 
estimated the density of clusters at intermediate redshift 



V aoo (h 'M 3 ) 




Fig. 14. — The number function dn/dN t §§§ for the 2MASS sam- 
ple. The open triangles, filled squares and open pentagons show the 
results for Afresh = 3,5 and 10. In many cases these points overlap. 
The bootstrap error estimates are shown only for JVthiesh = 5- The 
smooth curve is the number function expected for the synthetic cat- 
alog in Fig. N. The heavy solid line and the filled triangles show the 
dn/dh r{ num ber functions from Donahue et al. (2001) and Postman 
et al. (E002h respectively, converted to our units using the empirical 
conversion of Eq. (pq). The lower dashed curve cutting off sharply 
at high mas s show s the luminosity function from the NOG in Mari- 
noni et al. {EOOOJ) converted to a number function as described in 
the text. The other dashed curve shows the fit to tho Cf A survey 
luminosity function from Moore, Frenk & White ( 199c ) converted 
in the same way. The equivalent mass scale shown at the top af_thc 
figure is the conversion from iV*666 to M200 estimated in Eq. O. 



in terms of A c ; (see § 3J3 and Eq. |20|) over the ranges 20 <J 
A c ; ^ 100 and A c / J> 30 respectively. They fit the number 
density as a power law of the form 



dn 
dk c i 



Nn 



A e ; 

40 



(39) 



finding N = 6±f x lO-^Mpc" 3 with a = 5.3 ±0.5 and 
N = (1.55 ± 0.40) x 10- 6 /if 5 Mpc" 3 with a = 4.40 ± 0.30 
respectively. If we use the empirical conversion between 
A c i and A*666 derived from comparing the two richness 
estimates at fixed X-ray luminosity (Eq. |38| ), then we find 
good agreement for the number density of N^qqq ~ 10 
clusters (see Fig. ^|). If, however, we use our theoreti- 
cal estimate of the conver sion ( see §3.6, Eq. P2|), then the 



converted Donahue et al. ( |2001 ) and Postman et al. ( [2002 ) 
number densities are nearly an order of magnitude higher 
than our own estima tes. S ince both our catalogs (§5.1) 
and Donahue et al. (2001, 2002| ) have similar complete- 
ness compared to X-ray surveys, the need to use the em- 
pirical conversion in order to obtain similar number den- 
sities suggests that there is a problem with the theoretical 
conversion between the two richne ss est imates. 

Marinoni, Hudson & Giuricin (|200C ) used the Nearby 



Optical Galaxy (NOG, Marinoni 2001) sample to make a 



simila r estim ate based on the group catalogs of Giuricin 
et al. ( 2001 ). If Ag = L tot /L* is the estimate for the total 
group luminosity L to t in units of the luminosity of an L* 
galaxy, they derive a number function of 



dn 

dA^ A 



P.xp(-A ( , V A ( ; (1 ) (40) 

Ago, 



GO 



where ac — -1-4, no = 4.8/% x 10~ 4 Mpc , and 
Ago — 9.9. For a galaxy luminosity function with their 
estimated Schechter slope of a = —1.1, the equivalent 
number of L > L* galaxies is Nqo — Ago/5.1 = 1.9. For 
groups selected with a standard FoF algorithm, we expect 
A*666 — 0.66Ag, but the a pplicability of this conversion 
to the Marinoni et al. ( 2000| ) estimates is just a rough esti- 
mate. The number function dn/dN^e66 is identical to the 
Schechter function dn/dAc with Ag replaced by AT^ggg and 
Ago = 1-3. This estimate, which is also shown in Fig. [u], 
is a factor of 5 higher than ours for low N^qqq and cuts 
off more sharply. The sharper cutoff is simply a conse- 
quence of the cz < 6000 km/s redshift cutoff of the NOG 
Sample, which eliminates all the nearby rich clusters. The 
normalization could be brought into agreement with our 
estimate if we have overestimated the coefficient in the 
relation A* 666 ~ 0.66Nq used to convert the FoF mem- 
bership into our standard overdensity. Finally we have 
converted the luminosity function o f the CfA sample, as 
derived by Moore, Frenk & White ( {L993p , to dn/dN* 666 
in the same manner as described above and with the same 
difficulties. This is plotted as the other dashed curve which 
agrees with the NOG estimate at small Af„ 666 but has sig- 
nificantly more objects a t high A^,g66- As with the compar- 
ison to Donahue et al. (2001) and Postman et al. (2002), 
differences in the absolute number density are very sensi- 
tive to errors in the conversion between richness estimates, 
and we lack any means of doing this more prec isely given 
the informati on av ailable in Marinoni et al. (2000) and 
Moore et al. ( |1993| ). 
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Table || also gives the estimates for the halo occu pancy 
function needed to transform the Jenkins et al. ( |2001 ) 



mass function for our assumed cos molo gy into the ob- 
served number function (as in section 4.2). The results for 
-^thresh = 3, 5 and 10 are mutually consistent and inter- 
mediate to th e re sults from fitting the observed occupancy 
functions in |5.3| . 

5.5. Derived Quantities 

Finally we discuss the implied infrared mass-to-light ra- 
tio of clusters and the fraction of the local luminosity den- 
sity associated with the virialized regions of clusters. We 
start by adopting a standard model for the halo occupa- 
tion function. Our uncertainties in the occupation func- 
tion are dominated by systematic errors in how to relate 
the observed quantities to cluster masses. Depending on 
whether we set the mass scale using the velocity disper- 
sion, the X-ray temperature, or the cluster mass function, 
we find significant differences in both the normalization 
and the slope of the occupancy function (see Table ||) . We 
decided to simply average the results for the three possi- 
ble mass scales and use their dispersions as the standard 
errors: 

logAW; = (1.44±0.17)+(1.10±0.09)log(M 200 /i/10 15 M Q ). 

( 41 ) 

The resulting errors are larger than the statistical un- 
certainties for any given mass normalization method and 
somewhat larger than the systematic offsets we found 
when estimating the occupancy function of the synthetic 
catalog. The normalization is higher than we used in the 
synthetic catalog, which seems reasonable given the visibly 
weaker fingers of god in the synthetic catalogs. 
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Fig. 15. — The fraction j c i ust (> N*666)/ j total of all stellar lumi- 
nosity in systems with at least -/V„6gg galaxies. The open triangles, 
filled squares and open pentagons show the results for Afresh = 3, 
5 and 10. In many cases these points overlap. The bootstrap error 
bars are shown only for iVthrcsh = 5. The equivalent mass scale 
shown at the topxiLthe figure is the conversion from Af*666 to M200 
estimated in Eq. Ull 



From Eq. (^) we can estimate more traditional quan- 
tities like the K-band mass-to-light ratios of the clusters. 
Including the uncertainties in the luminosity function, the 
correction from aperture to total magnitudes, a nd a zero 
point of M p,K. = 3.39 mag (see Kochanek et al. 2001 and 
Cole et al. po6l|) we find that 



Mi 



L 



666 / k 



(116±46)ft 



Monnh 



10 15 M ( 



0.10±0.09 



(42) 



where almost all the uncertainty comes from the uncer- 
tainty in the normalization A — 1.44 ± 0.17 of the oc- 
cupancy function (it would be (116 ± I0)h for A = 1.44). 
This is nearly i dentic al to the baseline semi-analytic model 
of Cole et al. (2000) where the K-band mass-to-light ra- 



tio is 118(Af2ou7vT0 15 Mo) - 06 for clusters with masses 
above 10 13 ' 5 ft. _1 Af©. The mass-to-light ratio is essen- 
tially independent of mass, which differs so mewha t from 
estimates in the B-ba nd (e.g. Girardi et al. 2000 , Mari- 
noni & Hudson 2001) which suggest a steeper slope of 
(M/L)b oc m - 1 ^- 6 , albeit with significant uncertain- 
ties in the slopes. If there is a real slope difference, 
with (M I 'L)b I '{M I L)k oc M x then we can explain this 
as a trend in the colors of the galaxy population with 
A(B — K) = 5x ~ 0.5 to 1.5 mag over the range from 
10 13 M© to 10 15 M©. All general trends in cluster prop- 
erties have the sense required to produce the slope dif- 
ference. Starting from a constant K-band mass-to-light 
ratio, the more massive clusters could be dimmer in the 
B-band because of the increasing early-type galaxy frac- 
tion (a B-K color shift of roughly 0.5 mag from Sa to E), 
increasing metallicity (A(B-K) /[Fe/H] ~ 1.2 mag/dex) 
or increasing age (A(B-K)/logi ~ 0.5 to 1.0 mag/dex). 
Like the halo occupation number itself, it is important 
to match definitions when comparing our mass-to-light 
ratios to other estimates. Our estimate is normalized 
to be the ratios of the two quantities in spheres of ra- 
dius rpfQQQ ~ ^M200: leading to higher mass to light ra- 
tios than estimates comparing the light in cylind ers to 
the mass in spheres. For example, Rines et al. ( [2001 ) 
found {M/L) K = (75 ± 23) /i for Coma using the 2MASS 
survey, and the Kochanek et al. ( 2001 ) luminosity func- 
tion. At the rjV666 = l-5/i -1 Mpc, which also matches the 
Rines et al. (2001) estimate of the virial radius, we must 
raise their estimate of the mass to light ratio by 25% to 
(M/L)k = (92 ± 28) h to compare it to our estimates of 
the mass-to-light in spheres. This is quite close to our esti- 
mate of (M/L)k — (116±46)/i for the mass-to-light ratio 
of 1O 15 /i _1 M cluster like Coma in Eq. @. 

Since the K-band luminosity is well correlated with the 
total stellar mass, we can use the cluster number function 
to obtain a rough estimate of the fraction of stars in sys- 
tems above a given mass or number threshold. The total 
luminosity density of all galaxies is jtot = n*£*r[2 + a] 
while the equivalent luminosity density contained clusters 
with iV*666 > N (and in the region with r < tnqqq) is 
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dN^ m N, 
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(43) 



The ratio jciust/jtot is shown in Fig. [l5] as a function of 
A^*666- We find that the virialized regions of clusters more 
massive than 10 14 /i _1 M Q contain ~ 7% of the local lumi- 
nosity. This is quite comparable to the fraction of the mass 
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contained in such clusters in popular ACDM models with 
i7 mat ~ 0.3 and <7g ~ 1, though the number is particularly 
sensitive to the latter assumption. The enclosed fraction 
also depends on the density contrast to which we extend 
the cluster edges. For example, if we include galaxies out 
to An — 100, the luminosity fraction in clusters doubles. 

Since our data is consistent with N(M) ~ M we would 
predict that 2MASS galaxies provide good tracers of the 
mass with a small and relatively scale- independent bias on 
Mpc scales. This would suggest that the correlation func- 
tion (or power spectrum) of 2MASS galaxies should show 
the virial 'inflection' of the dark matter power spectrum 
on Mpc scales, where the growth of fluctuations is first en- 
hanced by non-linear infall and then suppressed by virial 
motions within halos. There is weak evide nce fo r this in 
Fig. 3 of Allgood, Blumenthal & Primack QZOOiP , though 
further work with the full catalog needs to be done to re- 
ally test this prediction. That N(M) ~ M is perhaps not 
unexpected, as both simulations and observations show 
that galaxies selected in the "red" trace the matter dis- 
tribution better than those selected in the "b lue" . The 
original semi-analytic models (Kauffman et al. 1999Q pre- 
dicted a steeper slope for N(M) for their red galaxies, with 
N(M) ~ M 9 , which is consistent with our results. 

Theory predicts, and our simulations have assumed, 
that the number of galaxies in a cluster mass halo should 
follow a Poisson distribution (at fixed mass). It is diffi- 
cult to test this with our current sample due to the dif- 
ficulty of estimating the mass of our clusters. The ratio 
of (N(N - 1))^ 2 /(N), where N is the number of mem- 
ber galaxies determined from the matched filter probabil- 
ities pi, is very close to unity for clusters binned by iV»666- 
(The Poisson prediction is (N(N - l)) 1 ' 2 = (AT).) How- 
ever Af*666 is not a good proxy for M200, having significant 
scatter. For the small number (110) of clusters for which 
we can detect all L„ galaxies and we have an X-ray tem- 
perature, the ratio (N(N — l)) 1 / 2 / [N) is also consistent 
with unity, though we need to choose broad bins in Tx 
and the statistics are poor. 

6. CONCLUSIONS 

We have applied the matched filter technique to both 
simulated galaxy catalogs and the 2MASS galaxy cata- 
logue to search for clusters over approximately 90% of the 
sky to a redshift limit of z ~ 0.05. We have matched our 
2MASS derived catalog to existing catalogs in an auto- 
mated way using the NASA Extragalactic Database. Our 
algorithm appears to be both robust and efficient, return- 
ing quite complete samples of clusters out to distances 
where the typical cluster contains as few as 3 galaxies. The 
algorithm finds almost no 'false groups', the main source 
of contamination being the inclusion of objects below the 
mass cut in the catalog. 

The matched filter algorithm gives us a way of estimat- 
ing cluster membership and hence cluster properties like 
the multiplicity function, the number function and the ve- 
locity dispersion in a new way. While our estimates for the 
velocity dispersion agree well with earlier work, we typi- 
cally have far fewer galaxies per cluster than dedicated 
surveys and hence larger errors. This is offset by the fact 
that we have a large number of clusters, over much of the 
sky, with which to search for correlations between cluster 
properties such as velocity dispersion and X-ray tempera- 



ture. Where there is overlap we find quite good agreement 
with earlier work, though often with improved statistics. 

Although it is necessary to be very careful about the 
definition of N in order to compare with theory, we find 
that with sufficient care we have been able to estimate the 
cluster number function, dn/dN, over more than 2 orders 
of magnitude in N. 

By using the velocity dispersion, X-ray luminosity or 
X-ray temperature as a surrogate for mass we are able to 
estimate the multiplicity function N(M). Although there 
are serious issues in converting observables into cluster 
masses, all of our results are fairly consistent and suggest 
that N(M) ~ M or perhaps slightly steeper, in reasonable 
agreement with earlier estimates. With all quantities nor- 
malized by the spherical radius corresponding to a mass 
overdensity of Am = 200 or the equivalent galaxy num- 
ber overdensity of A at = 20051^/ = 666, we find that the 
number of L > L* galaxies in a cluster of mass -A/200 is 

logA, 666 = (1.44±0.17) + (1.10±0.09)log(Af 20 oVl0 15 M Q ). 

(44) 

The uncertainties in this relation are largely due to the 
choice made for relating the observed quantities to the 
cluster mass scale. For a fixed mass scale the scatter 
resulting from the different observed correlations is con- 
siderably smaller. Correlations of N with other cluster 
properties, X-ray luminosity, temperature or galaxy ve- 
locity dispersion, are given in §^3. The region has a ri- 
band cluster mass-to-light ratio of (M/L)k — (116±46)ft 
which is essentially independent of cluster mass. The un- 
certainties are again dominated by the choice of the mass 
scale. This scaling is consistent with N(M) ~ M, though 
if we take our best fit seriously and N(M) is steeper 
than M we expect M/L would fall slowly with increas- 
ing mass. Integrating over all clusters more massive than 
= 10 14 H~ 1 Mq, the virialized regions of clusters con- 
tain 7% of the local stellar luminosity, quite comparable 
to the (somewhat theory dependent) mass fraction in such 
objects in currently popular ACDM models. 

The cluster likelihoods tend to be larger in the real data 
than our mock catalogs. This difference could have sev- 
eral sources. It could be due to the concentration of red- 
shift measurements in the real catalog towards groups and 
clusters rather than having the random distribution of the 
synthetic catalog. It may also indicate that the cosmol- 
ogy adopted in the underlying simulation is incorrect (the 
number of clusters is very sensitive to eg for example), 
the normalization of the input N(M) may be too low, 
our assumed N(M) may have too many galaxies in low- 
mass halos compared to high-mass halos, or the luminos- 
ity function could vary systematically with the parent halo 
mass rather than being fixed (as we have assumed). Along 
these lines, both the expectation that light closely trace 
mass in i"T-band and that the luminosity function change 
with mass suggest that clusters contain a larger fraction of 
galaxies than we have assumed in our current generation 
of mock catalogs. These avenues will be explored in the 
future with larger simulations and better data. 

We view this work as but the first step in an iterative 
sequence. Based on simulations which we know describe 
reality imperfectly, we have calibrated our cluster finding 
algorithm. When applied to the real data this algorithm 
allows us to estimate correlations between different prop- 
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erties of a halo which can be used in the next stage to 
improve the simulated catalogs. As the 2MASS catalog 
becomes increasingly complete, and more redshifts become 
available, we will estimate the global relations between dif- 
ferent halo parameters by providing them as priors to the 
fitting, and varying the parameters to optimize the global 
likelihood. These relations can then be used in the con- 
struction of the galaxy catalogs from improved simulations 
which will allow us to further optimize and understand the 
cluster finding algorithm. 
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APPENDIX 

INCLUDING COLOR INFORMATION 

In this appendix we outline the inclusion of color in- 
formation in our algorithm. To include color information 
in our algorithm we require a model for the multi-variate 
luminosity function in luminosity and colors, <p(M,Co), 
and its evolution with redshift. The multi-variate and 
standard luminosity functions are related by 4>(M) = 
J (f>(M,Co)dCo- Assuming that the numbers of galaxies 
are not evolving, the measured magnitudes and colors, m 
and c are related to the local values by M = m — T>(z) 
(Eq. 0) and C = c - C(M, z) to remove the effects of 
distance, K-corrections and evolution. 

For the 2MASS survey the description simplifies fur- 
ther because only the H-K color has a significant cor- 
relation with redshift and neither the H-K nor the J-H 
color has a significant correlation with luminosity. This 
means that we can factor the bivariate luminosity func- 
tion, 0(M,C o ) = 4>(M)£(C ), with f £(C )dC = 1. The 
color distribution, which consists of a narrow peak with a 
red tail, is well modeled by the sum of two Gaussians, 

£(H-K) = ^ c -{H-K-a )/2a 2 n | 1 ~ / & -{H-K-a x ) 

(Al) 



where / = 0.698, a = 0.226, a x = 0.268, a = 0.025 and 
o\ = 0.081. The galaxies become steadily redder at higher 
redshift, with 



C(z) = 2.333z- 0.770z 2 



(A2) 



and it appears broader at fainter magnitudes due to mea- 
surement errors that can be modeled by convolving the dis- 
tribution with a Gaussian of width a = 0.092 x 10 2(K - 13) . 
While the errors in the flux are unimportant to our analy- 
sis given the sample magnitude limit, the shallow slope of 
C(z) means that we must take into account the errors in 
the H-K colors. 

Given the bivariate luminosity function and the depen- 
dence of the colors on redshift , w e can mod ify the terms of 
the likelihood function (see §3.2 and §3.3) to include the 



additional information. The probability of finding a field 
galaxy with magnitude m„ color Cj and redshift z, is 

P f (mi,a,Zi) = OA\nlOD 2 c (z l )^^ f (m-V(z, i ))C(c, l ~C(z l )). 

(A3) 

Integrating over redshift, we obtain the number counts 
of galaxies with a given color, which is equivalent to the 
probability of finding a galaxy with magnitude m, and 
color Cj, 



Pfirrn 



Pf(mi,Ci,z)dz. 



(A4) 



The probability distribution for an unknown galaxy red- 
shift is simply the ratio Pf(nii, Ci, Zi)/Pf(rrii, c;). Because 
the slope of C is shallow, the addition of the H-K color in- 
formation only modestly improves the accuracy of redshift 
estimates. For the 50000 galaxies with known redshifts, 
the observed errors in the predicted redshifts are mod- 
estly smaller than their theoretical estimates of a z — 0.028 
(using only fluxes), 0.036 (using only colors) and 0.024 
(using both). The errors in redshifts estimated using col- 
ors (fluxes) grow slowly (linearly) with redshift, so at low 
redshifts fluxes always become better redshift estimators 
than colors. We must also modify the cluster membership 
probabilities, replacing <f> c (M) by <f> c (M,Co), while leav- 
ing the normalization factor in the denominator, $ C (M), 
unchanged. 

This formalism for including color information in a 
matched filter algorithm can be generalized to additional 
colors and more complicated descriptions for the evolution 
of colors with redshift and luminosity, although tabulated 
models will probably be required. Matched filters can also 
exploit the redder optical colors of cluste r galaxies (as used 
in other algorithms, e.g. Goto et al. 200l| ) by using a multi- 
variate cluster luminosity function with a different color 
distribution from that for the field. 
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